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/This application is a divisional application of V S. application serial number 
00/309. 320. filed May 1 1, 1999, which is a divisional of U.S. application serial number 

^' 09/096, S7L filed June 12, 1998, issued November 2, 1999. as U.S. Patent No. 5. 976,528. 

which is a div isional of L'.S. application serial number 08/736,771, filed November 26, 1996, 

5 issued October 6, 1998, as U.S. Patent No. 3,817,497. U.S. application serial numbers 
09/309,320, 09/096,371. and 08/736,771 arc hei-cby expressly incoiporatcd by reference. 

FIELD OF THE INVENTION 

This invention relates to nucleic acid and amino acid sequences of a novel glutathione 
10 s-transferase and to the use of these sequences in the diagnosis, prevention, and treatment of 
cancer. 

BACKGROUND OF THE INVENTION 

The glutathione s-transferases (GST) are a ubiquitous family of enzymes with dual 
13 substrate specificities that perform impoilant biochemical functions of xenobiotic 

biotransformation and detoxification, drug metabolism, and protection of tissues against 
peroxidative damage. The basic reaction catalyzed by these enzymes is the conjugation of an 
electrophilc with reduced glutathione (GSFl). which results in cither activation or 
deactix ation/dctoxification ot the chemical. The absolute requirement for bniding reduced 
20 ( iSH tt> a \\ idc \ ancl\ of cheniicals necessitates a di \ ersttx m ( iS T structuivs in \ ai ious 

organisms and cell t\pcs. 

CiSTs are homodimeric or hetetxKli me nc protems localized m the cell cytosoL 1 he 
major isozvmes share common structural and catalytic properties and in man have been 
classified into tour maior classes. Alpha, Mu. Pi. and 1 beta. The two largest classes. Alpha 
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parlicular clcctrophilic substrates. Specific ainnu> acid residues uiihin CiS I s ha\e been 
identified as important for these hmdmii sites and for calalMic acti\it\. Residues 067. lY^S, 
DlOl. [:l()4, and R131 are important for the binding of (iSH (Lee. H-(^ et al. ( 1^)^)5) J. Biol. 
Cliem. 270: ^;9-l()9). Residues R 1 . v R2(). and R69 are important for the catal\tic acti\it\ of 
S GST (Stenberg G et al. ( 1 Wl ) Biochem. J. 274: 549-5^^). 

hi most cases, GSTs perform the beneficial function of deactix ation and detoxification 
of potentially mutagenic and carcmogenic chemicals. Howev er, in some cases their action is 
detrimental and results in actu ation \\ ith consequent mutagenic and carcinogenic effects. 
Some forms of rat and human GSTs are reliable preneoplastic markers that aid in the 

10 detection of carcinogenesis. Expression of human GSTs in bacterial strains, such as 

S a 1 rn on c ! ! a t y p h i m u n u m , used i ri t h c w oil k ri o \\ n A rri c s test i r rri u t a g e n i c 1 1 y , I'l a s li c I [)C d t C) 
establish the role of these enzymes in mutagenesis. Dihalomcthanes. which produce liver 
tumors in mice, arc believed to be activated by GST. This view is supported by the finding 
that dihalomcthanes arc more mutagenic in bacterial cells expressing human GST than in 

15 untransfectcd cells (Thicr, R. et ah (1993) Pioc. Natl. Acad. Sci. USA 90: 8567-80). The 
mutagenicity of ethylene dibromide and ethylene dichloridc is increased in bacterial cells 
expressing the human Alpha GST, A 1-1, while the mutagenicity of aflatoxin 81 is 
substantially reduced by enhancing the expression of GST (Simula, T.P. et al. (1993) 
Carcinogenesis 14: 1371-6). Thus, control of GST activity may be useful in the control of 

20 mutagenesis and carcinogenesis. 

GST has been implicated in the acquired resistance of many cancers to drug treatment, 
the phenomenon known as multi-drug resistance (MDR). MDR occurs when a cancer 
patient is treateil w ith a cxtotoxic drug such as c\ cloj^hosplianiiLle anil subse(.|uent 1 \ becomes 
I'cs] slant to this clrug aiul to a \ arict \ ot' iiUkm' c\ totox r- agents as well. IncreasCLi ( iS I lex el s 

2."^ arc associatCLl with some ot these di'ug rcsislanl cancers, aiKl it is hclie\ei.l that this increase 

occurs in response to the drug agent w hich is then deactiv ated b\ the GS T catal\/ed CiSH 
conjugation reaction. The increased GS T levels then protect the cancer cells from other 
cMotoxic ai:ents lor winch GS T has aftinilv. Increased levels of Al 1 in tumors has been 
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m treating MDR in cancer patients. 

I he Liisco\er\ oi polynucleotides encoding no\el glutathione s-tiansi erases and the 
molecules themselves, prov ides the means to iurther investigate the role of these en/v mes m 
chemical mutagenesis and carcinogenesis and m drug resistance and cancer. In particular, the 
5 abihty to control GST activity may be usetul m the prevention of carcinogenesis where the 
risk of exposure to carcinogens or mutagens is high and in the prevention and treatment of 
MDR in cancerous tissues. 

SUMMARY OF THE INVENTION 

10 The present invention features a novel human glutathione s-transferase hereinafter 

designated HGST and characterized as having similarity to Aipha class GSTs from noimal 
human liver. pGTH2, (GI 825605), human hepatoma, Al-K (GI 259141), and mouse lung, 
GST 5.7, (GI 193710). Accordingly, the invention features a substantially purified HGST 
having the amino acid sequence, SEQ ID NO: 1. 

15 One aspect of the invention featuies isolated and substantially purified 

polynucleotides that encode HGST. In a particular aspect, the polynucleotide is the 
nucleotide sequence of SEQ ID NO:2. 

The invention also relates to a polynucleotide sequence eomprismg the complement of 
SEQ ID NO:2 or variants theieof. In addition, the invention features pc^lynucleotide 

20 sequences v\hich hybridize under stringent conditions to SEQ ID NO:2. 

The invention additionally featui-es nucleic acid sequences encoding polypeptides, 
oligonucleotides, peptide nucleic acids (PNA), fragments, portions or antisense molecules 
thereof, and expression vectors and host celts comprising pol\ nucleotides that encode I IGS I . 
I he prcseni invcmton also features antibodies \\ hicli bnid spccificallv to IKiS 1. and 

25 [pharmaceutical compositions comprising substantial 1\ purified IKiS T. The mvenlion also 

features the use of agonists and antagonists of HCjS1\ 
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I^RO sot'lwaiv (Hitachi Software LingincLMing C\>., Ltd., San Briinc), CA). 

lagLires 2A and 2B show the amino acid secjiience ahgnmenls amcMig HGS^r (SHQ) ID 
NO: I ), and Alpha class glutathione s-transferases from normal human li\er, pCi I'H2, (Cil 
825605: SHQ ID NO;3) , human hepatoma, Al-l. (GI 259141; SHQ ID NO:4), and mouse 
5 lung, GST 5.7, (GI 193710, SEQ ID NO:5). llie alignment was produced using the 

multisequence alignment program of DNASTAR software (DNAvSTAR Inc, Madison WI). 

Figure 3 shows the hydiophobicity plot (MACDNASIS PRO software) for HOST, 
SEQ ID NO: I; the positive X axis reflects amino acid position, and the negative Y axis, 
hydrophobicity. 

10 Figure 4 shows the hydrophobicity plot for pGTH2. SEQ ID NO:3. 

Figure 5 show's the hydrophobicity plot for A^l-l, SEQ ID N]0:4. 

Figure 6 shows the hydrophobicity plot for GST 5.7, SEQ ID NO:5 

Figure 7 shows the isoelectric point analysis for HGST, SEQ ID NO:l. 

Figure 8 shows the isoelectric point analysis for pGHT2, SEQ ID NO:3 
15 Figure 9 shows the isoelectric point analysis for Al-I, SEQ ID NO:4 



DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods arc described, it is 
20 understood that this invention is not limited to the particular methodology, protocols, cell 
lines, vectors, and reagents described as these may vaiy. It is also to be understood that the 
terminology used herein is for the puipose of describing particular embodiments only, and is 
not mtcnded to limn the scope of the present iincntuMi w inch uili be hmncd onl\ b\ the 
appeiuicd clai ms. 

25 It must he noted that as used herein and m the apjKMidcd clanns, ific singular forms 

'"a", "an", and "the" include plural reference unless the context clearly dictates otherwise. 
T hus, for examine, relerenLC lo "a host cell" includes a pluralitx of such host cells, reference 
to the "antibodx" is a reference to one or more aniiboelies and cciunalents thereof known to 




meanings as commonly understood by one of ordmaiA skill in the ail to w hich this unention 
belongs. Although an\ methods and materials similar or eqinv alent to those described herein 
can be used m the practice or testing of the present unention. the preterred methi)ds, de\ ices. 
and materials are now described. All publications mentioned herein are incorporated herein 
S bv reference for the purpose of describing and disclosmg the ceil lines, vectors, and 

methodologies u hich are reported in the publications which might be used in connectuMi with 
the invention. Nothing herein is to be ct^nstrucd as an admission that the invention is not 
entitled to antedate such disclosure by viitue of prior invention. 

10 I)i:fimtions 

^'Nucleic acid sequence" as used herein refers to an oligonucleotide, nucleotide, or 
polynucleotide, and fragments or portions thereof, and to DN A or RNA of genomic or 
synthetic origin which may be single- or double-stranded, and represent the sense or antisense 
strand. Similarly, "amino acid sequence'' as used herein refers to an oligopeptide, peptide, 
15 polypeptide, or protein sequence, and fragments or portions thereof, and to naturally 
occurring or synthetic molecules. 

Where "amino acid sequence" is recited herein to refer to an amino acid sequence of a 
naturally occurring protein molecule, "amino acid sequence" and like terms, such as 
"polypeptide" or "protein" are not meant to limit the amino acid sequence to the complete, 
20 native amino acid sequence associated with the recited protein molecule. 

"Peptide nucleic acid", as used herein, refers to a molecule which comprises an 
oligc^mer to w hich an amino acid residue, such as lysine, and an amino gnnip have been 
added, 'fhese small molecules, also designated anti-gene agents, stop transcript elongation b\ 
binding to their complcmcntai ) sUand ot nucleic acid ( Nielsen. P I:, el al. ( 1 ) Anticancer 
2S Drug Des. 8:53-b3). 

MCjS1\ as used herein, refers t(^ the amino acid seciuences of substantially purified 
IK iS r obtained Irom an\ species, particulaiiv mammalian, including bcn ine. o\ ine. porcine, 
murine, ccjume, aiul prelerablv human, ivoni an\ source whether natural, sxnthctic. 
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resctiucnccd to rcsol\c Lincallcd bases, ov w hich has been extended using XL-F^(/R (Perkm 
Himer, Norwalk, C'V) m the 5" and/or the 3' direction and reseqiienced. or which has been 
assembled troni the overhipping sec[Liences ot" more than one hicxte clone using the 
CjHLVIHW fragment assembly system (GCG, Madison, VVI), or w hich has been both 
5 extended and assembled. 

A "variant" of HGS^r, as used herein, refers to an amino acid sequence that is altered 
by one or more amino acids. The variant may have "conservative" changes, wherein a 
substituted amino acid has similar stiaictural or chemical properties, e.g., replacement of 
leucine with isoleueine. More rarely, a variant may have "nonconservative" changes, e.g., 
10 replacement of a glycine w ith a tryptophan. Similar minor variations may also include amino 
acid deletions or insertions, or both. Guidance in ueLermining winch amino acid residues may 
be substituted, inserted, or deleted without abolishing biological or immunological activity 
may be found using computer piogiams well known in the art, for example, DNASTAR 
softw are. 

13 A "deletion", as used herein, refers to a change in either amino acid or nucleotide 

sequence in which one or more amino acid or nucleotide residues, respectively, are absent. 

An "insertion" or "addition", as used herein, refers to a change in an amino acid or 
nucleotide sequence resulting in the addition of one or more amino acid or nucleotide 
residues, respectively, as compared to the naturally occunmg molecule. 
20 A "substitution", as used herein, refers to the replacement of one or more amino acids 

or nucleotides by different amino acids or nucleotides, respectively. 

The term "biologically active", as used herein, refers to a protein having structural, 
regulator) . or biochemical functions of a naturalK occurring mc^lecule. Likewise, 
"tmmunologicallN active" refers to ihc capabihtv of the nalural. recombinant, oi sMilhelic 
2."^ IIGS I , or an\ ohgopeptide thereot, to induce a specific immune response m a[ipiopiKae 

animals or cells and to bind w ith specific antibodies. 

l^he term "agonist", as used herein, reters to a molecule w hich, w hen bound to IIGST, 
causes a cliange in 1 1( iS 1 w hich n:iOLlulatcs the acti\ it\ of 1 ICiS 1. .Agonists ma\ include 
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biHind to HGS r, blocks ov modulates the biological or immunological acti\ ity of HCiS'T. 
Antagonists and inhibitors ma> include proteins, nucleic acids, carbohydrates, or any other 
molecules which bind to HCiS r. 

Vhc term "modulate", as used herein, refers to a change or an alteration in the 
5 biological activity of HGST. Modulation may be an increase or a decrease in protein activity, 
a change in binding characteristics, or any other change in the biological, junctional or 
immunological properties of HGST. 

The term ''mimetic", as used herein, refers to a molecule, the structure of which is 
developed from knowledge of the structure of HGST or portions thereof and, as such, is able 
10 to effect some or all of the actions of GSH-like molecules. 

Tlic term ''derivative , as used herein, reicrs io the chemical modification ot a nucleic 
acid encoding HGST or the encoded HGST. Illustrative of such modifications wa^uld be 
replacement of hydrogen by an alkyl, acyl, or amino group. A nucleic acid derivative would 
encode a polypeptide which retains essential biological characteristics of the natural 
15 molecule. 

The term ^'substantially purified'\ as used herein, refers to nucleic or amino acid 
sequences that are removed from their natural environment, isolated or separated, and are at 
least 60% free, preferably 75^7r free, and most preferably 907c free from other components 
w ith which they dvc naturally associated. 
20 ''Amplification'' as used herein refers to the production of additional copies of a 

nucleic acid sequence and is generally carried out using polymerase chain leaction (PCR) 
technologies well known in the art (Dieffenbach. CAV. and G.S. Dveksler (1995) PCR 
Primer, a Laboratory Manual , C\>ld Sj^rmg Harbor I^ress, Plaiin iew . N>'). 

The term "h\ bndi/ation". as used herein, rcfeis lo an\ process b\ which a sliand o\ 
25 nucleic acid buids with a complemcntar> strand through base pairing. 

The term "h}bridi/ation complex", as used herein, refers lo a complex formed 
between two nucleic acid secjuences by \ irtue of the formation of hydrogen bonds between 
complementar\ G and C" bases and between complcmentar\ A and \ bases: these h\drogen 
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complex ma> he lormed in solution (e.g., C',A or R,,t anat\sis) or between e>ne nucleic acid 
sequence present \u solution and another nucleic acid sequence immobili/ed on a solid 
support (e.g., membranes, filters, chips, pins or glass slides to w hich cells ha\e been tixed tor 
in situ hybridization). 

S The terms "complementary'' or "complementarity", as used herein, reler to the natural 

binding of polynucleotides under permissix e salt and temperature conditions by base-pairing. 
FH>r example, lor the sequence 'V\-G-T'' binds to the complementary sequence "T-C-A". 
Complementarity between two single-stranded molecules may be "partial", in which only 
some of the nucleic acids bind, or it may be complete when total complementarity exists 

10 between the single stranded molecules. The degree of complementarity between nucleic acid 
si I anus lias siginiiCant cttccts On thc ctlicicncy and Strength Oi hybndization bctwccM nucleic 
acid strands. This is of particular importance in amplification reactions, w hich depend upon 
binding between nucleic acids strands. 

The term "homo]ogy'\ as used herein, refers to a degree of complementarity. There 

15 may be partial homology or complete homology (i.e., identity). A partially complementary 
sequence is one that at least partially inhibits an identical sequence from hybridizing to a 
target nucleic acid; it is refeiTed to using the functional term "substantially homologous.'' 
The inhibition of hybridization of the completely complementary sequence to the target 
sequence may be examined using a hybndization assay (Southern or northern blot, solution 

20 hybridization and the like) under conditions of low stringency. A substantially homologous 
sequence or probe will compete for and inhibit the binding (i.e., the hybridization) of a 
completely homologous sequence or probe to the target sequence under conditions of low 
stringency This is not to sa\ that conditions ot low stringenc\ are such that n{)n-spccific 
binding is permittcel: low stringcncx conduions rcquiiv that thc binding ot two sccjucnccs to 

2."^ one another be a specific (i.e.. selective) interaction. The absence of non-specific binding 

may be tested by the use c^f a sec(Mui target secjuence w hich lacks even a partial degree of 
complcmentant\ (e.g.. less than about MY i identit\ ): m the absence of non-specific binding, 
thc probe will not h\bndi/e to thc second non-com|Memenlar\ target scciuencc. 
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base coinposilion ) of the sequence, nature of the target (DNA, RNA. base eomposilu>n, 
presence ni solutuMi or imniobi li/ation, etc.), and tiie concentration of the salts and other 
components (e.g.. the presence or absence of forniamide, dextran sulfate and/cM' polyeth\lene 
glvcol) are considered and the hybridization solution nia\ be varied to generate conditions of 
5 either low or high stringency different from, but equivalent to, the above listed conditions. 

1 he term ''stringent conditions'\ as used herein, is the '^strmgency" w hich occurs 
withm a range from about Tm-5' C (S C below the melting temperature ( I m) of the probe) to 
about 20' C to 25 C below Tm. As will be understood by those of skill in the art, the 
stringency of hybridization may be altered in order to identify or detect identical or related 
10 polynucleotide sequences. 

incLeiHi aniiseiisc , as u^cu iici ei ii, icici ^ iiuv^ luimuc ^t;qLIt;lHw^^^^ wun^iiun^ 

complementary to a specific DNA or RNA sequence. The term "antisense strand'^ is used in 
reference to a nucleic acid strand that is complementary to the "sense^' strand. Antisensc 
molecules may be produced by any method, including synthesis by ligating the gene(s) of 

15 interest in a reverse orientation to a vital promoter which permits the synthesis of a 

complementary strand. Once introduced into a cell, this transcribed strand combines with 
natural sequences produced by the cell to form duplexes. These duplexes then block either 
the further transcription or translation. In this manner, mutant phenotypes may be generated. 
The designation ''negative" is sometimes used in reference to the antiscnse strand, and 

20 ''positive" is sometimes used in reference to the sense strand. 

The term "portion", as used herein, with regard to a pix)tem (as in "a poilton of a given 
protein") refers to fragments of that protein. The fragments may range in size from four 
ammo acid residues to the entire amino acid sequence muius one amino acuL Thus, a protein 
"comiM ismg at least a portion of the amino acid sct|ucncc ot Sl 'O ID N( ): ! " encompasses the 

25 full-length human IKIS I and Iragments thcivot. 

" I ransformation", as defmed herein, describes a process by w hich exogenous DNA 
enters and changes a recipient cell. It ma> occur under natural or artificial c(Miditions using 
\arious methods well known m the art. I ranslormat ion umw rel\ on an\ known method tor 
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limited to. \iral mlcction, clcctropcM ation. Iipotection. and particle bombardment. Such 
"transtormed" cells incltide stabK transformed ceils m which the inserted DNA is capable of 
replication either as an aulonomoiisK replicating plasniid or as part of the host chromosome. 
I hey also inclnde cells which transiently express the inserted DNa\ or RNA for limited 
5 periods of time. 

The term '"antigenic determinant"\ as used herein, refers to that portion of a molecttle 
that makes contact with a particular antibody (i.e., an epitope). When a protein or fragment 
of a protein is used to immuni/c a host animal, numerous regions of the protein may induce 
the production of antibodies which bind specifically to a given region or three-dimensional 

10 structure on the protein: these regions or structures are refeired to as antigenic determinants. 
An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 

The terms ''specific binding" or ''specifically binding", as used herein, in reference to 
the interaction of an antibody and a piotein or peptide, mean that the interaction is dependent 

15 upon the presence of a particular structure (i.e., the antigenic determinant or epitope) on the 
protein: in other words, the antibody is recognizing and binding to a specific protein structure 
rather than to proteins in general. For example, if an antibody is specific for epitope "A", the 
presence of a protein containing epitope A (or free, unlabeled A) in a reaction containing 
lal')elcd "A" and the antibody will reduce the amount of labeled A bound to the antibody. 

20 The term "sample", as used herein, is used in its broadest sense. A biological sample 

suspected of containing nucleic acid encoding HGST or fragments thereof may comprise a 
cell- chromosomes isolated from a cell (e.g., a spread of metaphase chromosomes), genomic 
r)N/\ (m solution or bound to a solid support such as lor Southern analxsis), RNA (m 
solution or boLuid to a solid support such as for northern anal\sisL cDNA (in solution or 

23 bound to a solid suppoi t ). an extract (rom cells or a tissue, and tlie like. 

The term "correlates w ith expression of a polynucleotide", as used herem. indicates 
that the detection of the presence of ribonucleic acid that is similar t(^ S\]Q ID NO:2 by 
northern anal\sis is iiidicaUxc of the presence ot mRN.\ encoding IK iS T m a sample and 
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"Alterations" in the pol\ nucleotide of S\iQ ID NO: 2. as used herein, comprise an\ 
alteration m the seiiuence of pol\ nucleotides encoding HCiS T including deletions, insertions, 
and point mutations that may he detected using hybridization assays, hicluded within this 
definition is the detection of alterations to the genomic DNA sequence w hich encodes HGS T 
3 (e.g., by alterations in the pattern of restriction fragment length polymorphisms capable of 
hybndi/ing to SHQ ID NO:2), the inabihl\ of a selected fragment of SEQ ID NO: 2 to 
hyhridi/e to a sample of genomic DNA (e.g., using allele-specific oligonucleotide probes), 
and improper or unexpected hybridization, such as hybridization to a locus other than the 
normal chromosomal locus for the polynucleotide sequence encoding HGSl^ (e.g., using 

10 fluorescent m situ hybridization [FISH] to metaphasc chromosomes spreads), 

As used herein, ihe term aniibody'' refers to iniaci moiecuies as well as fragmenis 
thereof, such as Fab, F(ab')., and Fv, which are capable of binding the epitopic determinant. 
Antibodies that bind HOST polypeptides can be prepared using intact polypeptides or 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide 

15 or peptide used to immunize an animal can be derived from the transition of RNA or 

synthesized chemically, and can be conjugated to a carrier protein, if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine sci*um albumin and 
thyroglobulin. The coupled peptide is then used to immunize the animal (e.g., a mouse, a rat. 
or a rabbit). 

20 The term '^humanized antibody", as used herein, refers to antibody molecules in w hich 

amino acids have been replaced in the non-antigen binding regions in oi*dcr to more closely 
resemble a human antibody, while still retaining the original binding ability. 

1 IIK l\\ KM ION 

2.^ The iincntion i> based on the diNCo\ci \ of a no\cl human glutathione s-transt cra.sc, 

(HCiSD, the pol\ nucleotides encoding HCkS T, and the use of these compositions for the 
diagni^sis. prc\ention. or treatment of cancer. 

Nuctcic acids encoding the liuman 1 1( iS 1 ol the present iiuciilion were tirsi identified 




ID NC):2. w as dcnxcd from the follow ing ()\ ciiapping and/or extended nucleic acid 
sequences: Incyte Clones 1 333079/ Bl.Al) 11 n04, 1 32S546/ PANCNO TO?, 1422059/ 
KlDNNO ro^. and 2188683/ f^ROSNC) r26. 

In one embodiment, the invention enccMiipasses the novel human iilutathione 
S s-transterase. a polypeptide comprising the amino acid sequence of SHQ ID NO: 1. as shown 
m Figures 1 A, IB, and IC. HCjS7^ is 222 amino acids in length and has chemical and 
structural homology with two human Alpha CiSTs, pCj I H2 (GI 825603: SBQ ID NO:3), and 
Al-I (GI 2S914I), and a mouse Alpha GSM, GS r 3.7 (GI 193710). In particular, HGST 
shares 37^^r overall identity with each of the two human GS Fs and 399r identity with the 

10 mouse GST. In addition, various amino acid ixsidues found to be essential for the catalytic 
acLi vuy anu suosuaic oinuuig oi vjo is aic ctJiisci vcu iii nv^jo i aiiu iii luc i^niici uiicc vjo i 
molecules. These residues are: Y9, R13, R20, E32, 067, T68, R69, E97, DlOl, E104, and 
R131. Only residues E97 and E104 are not found in the mouse GST. E32 and E97 form salt 
bridges with R20 and R69, respectively, and these sail bridges or residues are thought to be 

13 important in structural stability of the GST molecule and may be important for catalysis. Y9 
is essential for catalysis by facilitating ionization of GSH. Residues Q67, T68, DlOl, E104, 
and R131 are important for the binding of GSH. As illustrated by Figures 3, 4, 3, and 6, 
HGST and the three Alpha GSTs have rather similar hydrophobicity plots. Figures 7, 8, and 
9 show the isoelectric point analyses for HGST, pGHT2, and Al-1. The pi values of 8.8, 9.0, 

20 and 9.3, respectively: fall within the range characteristic of Alpha GSTs. In addition to 
bladder tumor, partial transcripts of the cDNA encoding HGST are found in fetal tissues 
(kidney and pancreas) and in prostate tissue adjacent to prostate cancer. 

The 1 mention also encompasses IKiSf \ariants. A preferred IKiSf \anant is one 
ha\nig at least NO' V . and more preferabtx 90^7 , ammo acid sec|ucncc similantN {o ific IK IS I 

23 ammo acid sequence (SEQ ID NO: 1 ). A most prerciiCLl IKiS I \anant is one hav ing al least 

93^7 amino acid sequence similarity to Sl^Q ID NO: 1 . 

rhc 1 mention also encon^passes pol\ nucleotides w hich enctx^lc llCiS T. Accordingly, 
am nucleic acid sccjucncc w hich encodes the ammo acid sequence of llCiS 1 can be used to 
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N():2 as shown m l-igiircs 1 A, IB, and 1C\ 

II w ill be appreciated b\ those skdled in the art that as a result of the degenerae\ o\ the 
eenetie code, a nuihitude of nucleotide sec|uences enccKlmg IKiS 1\ some bearing minimal 
homolog\ to the nucleotide sequences of any know n and naturally occurring gene, may be 
5 produced. Thus, the m\ention contemplates each and every possible variation of nucleotide 
sequence thai could be made by selecting combinations based on possible codon choices. 
These combinations are made m accordance w ilh the standard triplet genetic code as applied 
to the nucleotide sequence of naturally occurring HGST, and all such variations arc to be 
considered as being specifically disclosed. 

10 Although nucleotide sequences which encode HGST and its variants are preferably 

1 1 i' 1 I 1 _ ^ ^ - ^1- - 1 - j„ . *i ♦..^,.11,, i_ir::cT^ . ,.^/i^,- 

capaoie oi uyuiiui/.iiig lu liic iiucujuiiue ^L;^ut;in^^^ ui naiuitinj m 1 1 115 iavjvji cim^v^i 

appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
sequences encoding HGST or its derivatives possessing a substantially different codon usage. 
Codons may be selected to increase the rate at which expression of the peptide occurs in a 

15 particular prokaryotic or eukaryotic host in accordance with the frequency with w hich 
pailicular codons are utilized by the host. Other reasons for substantially altering the 
nucleotide sequence encoding HGST and its derivatives without altering the encoded amino 
acid sequences include the production of RNA transcripts having more desirable properties, 
such as a greater halfTife, than tianscripts produced from the naturally occurring sequence. 

20 The invention also encompasses production of DNA sequences, or portions thereof, 

which encode HGST and its derivatives, entit-ely by synthetic chemistiy. After production, 
the svnthetic sequence may be inserted into any of the many available expression vectors and 
cell s\ stems using reagents that are well known m the art at the time of the filing of this 
application. Morcoxer, sxnthctic chennisu\ ma\ be used to iiitKKluce mutalions into a 

2^ sequence cncoiiing IK iS T or an> portion thcict)f. 

Also enc(mipassed by the inventuMi are poKnuclecnide secjuences that are capable ot 
h\bndi/mg to the claimed nucleotide sequences, and m particular, those shown in S\iQ ID 
N( ):2. imdcr \ ai ious conditions ot slnn.L:enc\ . 1 1\ bndi/ation conditions are bascil on the 




Methods Hn/\mi^L 152:^07-5 I I ). and nia\ he used at a defined stfingenex . 

Altered niicleie aeid sequences eneodnig HCiS 1^ w hieh are encompassed b\ the 
nn en I ion include deletions, insertions, or substitutions of dilTerent nucleotides resulting in a 
polynucleotide that encodes the same or a functionally equivalent MCiS T. The encoded 
5 protein may also ccmtain deletions, insertions, or substitutions of amino acid residues w hich 

produce a silent change and result in a functionally equivalent IIGS T. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the 
biological activity of HGST is retained. For example, negatively charged amino acids may 

10 include aspartic acid and glutamic acid; positively charged ammo acids may include lysine 
and aiginine, and amino acids wiui uncharged polar head grcRij^s having similar 
hydrophilicity values may include leucine, isoleucine, and valine: glycine and alanine: 
asparagine and glulamine: serine and threonine: phenylalanine and tyrosine. 

Also included within the scope of the present invention are alleles of the genes 

15 encoding HGST. As used herein, an "allele" or ''allelic sequence" is an alternative form of 
the gene which may result from at least one mutation in the nucleic acid sequence. Alleles 
may result in altered mRNAs or polypeptides whose structuie or function may or may not be 
altered. Any given gene may have none, one, or many allelic forms. Common mutational 
changes which give rise to alleles arc generally ascribed to natural deletions, additions, or 

20 substitutions of nucleotides. E^ach of these types of changes may occur alone, or in 
combination with the others, one or more times in a given sequence. 

Methods for DNA sequencing which are well known and generally available in the art 
ma\ be used to practice an\ embodiments of the unention. Ttic methods ma\ cmpttiy such 
cn/\mes as the Klcnou fragment ot DNA poUnierasc I. S1{{^)1 ' 1 :N .\S1 (I S Biochemical 

2,"^ Gorp, Glcxclaiid, Oil), laci pol>meiasc ilVMkm Idmci ), ihcrmoslablc 17 polxincrasc 

(Amersham. (liicago. IL). t^- combinations c^f recombinant pol\ tnerases and proofreading 
e\(Miucleases such as the l^IX)N(iASl{ ampHlicatuMi system marketed by Gibco HRL 
( Gait lie rs burg. MD). Picleiablw the {ikkcss is automated wuh maclnncs sucli as the 
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1 he nucleic acid sequences encoding HCiS T may he extended ulih/iiii: a partial 
nuclecHide sequence and employing \ arious methods know n m the art to detect upstream 
sequences such as promcners and regulatoiA elements. For example, one method w hich ma\ 
he employed, "restrictunvsite" PC^R, uses uni\ersal primers to retriexe unknown seciuence 
S adjacent to a known locus (Sarkar, G. ( 1993) PCR Methods Applic. 2:3 18-322). hi 

particular, genomic DN A is first amplified in the presence of primer to linker sequence and a 
primer specific to the known region. Vhc amplified sequences are then subjected to a second 
round of PCR with the same linker primer and another specific primer internal to the first 
one. Products of each round of PCR are transct ibcd with an appropriate RNA polymerase 
10 and sequenced using reverse transcriptase. 

based on a known region (Triglia, T. et ah (1988) Nucleic Acids Res. 16:8186). The pnmcrs 
may be designed using OLIGO 4.06 primer analysis softw are (National Biosciences Inc., 
Plymouth, MN), or another appropriate program, to be 22-30 nucleotides in length, to have a 

1 5 GC content of 50% or more, and to anneal to the target sequence at temperatures about 

68''-72° C. The method uses several restriction enzymes to generate a suitable fragment in 
the known region of a gene. The fragment is then circularized by intramolecular ligation and 
used as a PCR template. 

Another method which may be used is capture PCR which involves PCR 

20 amplification of DNA fragments adjacent to a known sequence in human and yeast artificial 
chromosome DNA (Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:111-1 19). In this 
method, multiple restriction enzyme digestions and ligations may also be used to place an 
engineered double-stranded seciuence intt> an unknown portion of the DNA molecule betorc 
performing P(^R. 

2."^ Another method which ma\ be u>cd lo ictricvc unknown sciiucnces is that ot Parker. 

J.D. et al. ( 1991; Nucleic Acids Res. 19:30^3-3060). Additionally, one may use PCR, nested 
primers, and PROMCrritRMNDHR libraries to walk in gencMiiic DNA ((^ontech. Palo Altt>. 
CA l. I his jM'occss avoids the need to sciecn libraries and is uscliil m Imduig mtron/exon 
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si/e-sclcclcd to include larger cDN As. Also, random primed libraries are preferable, in that 
the\ uill coniam more sequences which ct>ntain the 3" regions of genes. Use of a randoml\ 
primed librar\ ma\ be especially preferable tor situations in which an oligo d{'I") lihrarv does 
not \ield a fulMength cDNA. Genomic libraries ma\ be useful for extension of sequence into 
3 the 5* and 3' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to 
analy/e the si/e or conHrm the nucleotide sequence of sequencing or PCR products, hi 
particular, capillary sequencmg may employ flowable polymers lor electrophoi-etic scpai-ation, 
four different fluorescent dyes (one for each nucleotide) which are laser activated, and 
10 detection of the emitted wavelengths by a charge coupled device camera. Output/light 

1 n t jTi ri 1 t \ / / 1-v/zi ^ ^ /-xr^ \ / f^>. ^^l/^^if^^^*.!.! . ,,,1 — — . _ / ✓'^ l" " X T r r^r^ 

and SEQUENCE NAVIGATOR, Pcrkin Elmer) and the entire process from loading of 
samples to computer analysis and electronic data display may be computer controlled. 
Capillary electrophoresis is especially preferable for the sequencmg of small pieces of DNA 
\5 which might be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments 
thereof which encode HGST, or fusion proteins or functional equivalents thereof, may be 
used in lecombinant DNA molecules to diiect expression of HGST in appropriate host cells. 
Due to the inherent degeneracy of the genetic code, other DNA sequences which encode 
20 substantially the same or a functionally equivalent amino acid sequence may be produced and 
these sequences may be used to clone and express HGST. 

As will be understood by those of skill in the art. it may be advantageous to produce 
I IGS f-cncoding nucleotide scc]uences possessing non -natural 1 \ t)ccurring codons. boi 
example, codons ]Trc!crrcd b\ a parliculai' [MokarNdlic or eukar\otic host can be selected to 
Micrcasc the rate of j^idtcm cxj^vssion or to produce a ivcombinant RNA UanscrijM ha\mi: 
desirable properties, such as a half-life w hich is h^iger than that of a transcript generated 
\ynm the naturally occurring sec|iience. 

I he nuclecMide scc|uciiccs ot the present invention can be engineered limiil! methods 




expression ot the gene product. DNA shntTHng by random IragmenlatuMi and PCR 
reassemhK ot gene tragn^enls and s\nthelie oligonLicleolidcs nui\ be used engnieer llie 
nucleotide secjuences. hor e\ani|^le, site-directed mutagenesis ma\ he used to insert new 
restriction sites, alter gi vcosn lation patterns, change codon pre te re nee, produce sp hce 
3 variants, or introduce mutations, and so forth. 

hi another embodiment of the invention, natinal. modified, or recombinant nucleic 
acid sequences encoding HGSI^ may be hgated to a heterologous sequence to encode a fusion 
protein. For example, to screen peptide hbrarics for inhibitors of HGST activity, it may be 
useful to encode a chimeric HGST protein thai can be rccogni/cd by a commercially available 

10 antibody. A fusion protein may also be engineered to contain a cleavage site located between 
the HGST eiicouiiig sequence and tlie lictcrologous protcni sequence, sc^ that HGST may be 
cleaved and purified away from the heterologous moiety. 

In another embodiment, sequences encoding HGST may be synthesized, m whole or 
in part, using chemical methods well known in the art (see Caruthers, M.H. et al. (1980) Nuel. 

15 Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl. Acids Res. Symp. Scr. 225-232). 
Alternatively, the protein itself may be produced using chemical methods to synthesize the 
amino acid sequence of HGST, c^^ a portion thereof. For example, peptide synthesis can be 
performed using various solid-phase techniques (Roberge, J.Y. et al. (1995) Science 
269:202-204) and automated synthesis may be achieved, for example, using the ABl 43 1 A 

20 peptide synthesizer (Perkin Elmer). 

The newly synthesized peptide may be substantially purified by preparative high 
performance liquid chromatography (e.g., Creighton, T. ( 1983) Proteins , Structures and 
Molecular Principles . \VH iMceman and (\).. New ^'ork, N^'). The comj^osilion of the 
svnthctic peptides ma\ be confirmed b\ amiiK) acid analysis or sCL]Licncmg (e.g.. ihc pAlman 

25 dcgiadation jMoceduic: C'rcighton. supra). Additionally the amnio acid sequence ol IIGS 1. 

or an\ part thereof, mav be altered dm'ing direct synthesis and/or combined using chemical 
methods \\ ith secjuences \rom other proteins, or an\ piwl theivof. to j^rculuce a v ariant 
polvpciMidc. 
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\cctor \\ hich contains ihc ncccssar\ clcmLMiis lor [he transcription and translation of the 
inserted coding sequence. 

Methods w hich are well know n to those skilled m the art ma> he used to construct 
expression vectors containing sequences encoding HGS F and appropriate transcriptional and 
5 translational control elements. 1 hese methods include m vitro recomhinant DNA techniques, 
synthetic techniques, and in vi\o genetic recombination. Such techniques are described in 
Sambrook, J. et ah ( 1989) Molecular dealing , A L.aboratory Manual , Cold Spring Harbor 
Press. Plamvievv, NY, and Ausubel, F.M. et al. ( 1989) Current Protocols m Molecular 
Biology , John Wiley & Sons, New York. NY. 

10 A variety of expression vector/host systems may be utilized to contam and express 

i_rr::cT^ t^u,^,,.. ; — u..* ♦ ™: : ,u .... 

o»^qij\^i iv^ V7\ai I li^ 1 ivj>^ 1 . 11 1 uv^ 1 Lna»^, i^LiL tiiv^ iii^L 1 1 1 1 II Lv^ia 1 1 J ici ^ui II ."5 1 1 1^ i ui^ 1 1 a:^ 

bacteria transformed w ith recombinant bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expiession vectors; insect cell systems infected with 
virus expression vectors (e.g., baculovirus); plant cell systems transformed with virus 

15 expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. 

The 'Yx)nti-ol elements" or "legulatory sequences" are those non-translated regions of 
the vector— enhancers, promoters, 5' and 3' untianslated legions— which interact with host 
cellular proteins to carry out transcription and translation. Such elements may vary in their 

20 strength and specificity. Depending on the vector system and host utilized, any number of 
suitable transcription and translation elements, including constitutive and inducible 
promoters, may be used. For example, when cloning in bacterial systems, inducible 
prtMiioters such as the h\brid lac/ promoter of the Bl A \iS( R\P \ phagemid (Stratagene. 1 .a 
Jolla. (W) or PSPOR 11 n\ asmi d ( C 1 1 bco B R P ) and the like nia\ be used. 1 he baculovirus 

2."~> ixi|\ficdiin promoter nia\ be used m nisect cells, [^romolcis oi cnhanceis dcrncd rrt>m the 

genomes of plant cells (e.g., heat shock. RUBISCO; and storage protein genes) or from plant 
\ iruses (e.g., \ iral promoters or leader secjuences) max be cloned into the xector. In 
inannnahan cell sxstems, promoters from mammalian genes ov from mammalian \iruscs arc 



IN 




selectable marker. 

In bacterial s\ stems, a number ot expression \ecti)rs ma> be selected depending upon 
the use intended lor MCjS r. For example, u hen large quantities of HGS T are needed \ov the 
induction of antibodies, \ectors w hich direct high le\el expression of fusion proteins that are 
S readiiy purified may be used. Such \ectors mclude, but are not limited to, the multifunctional 
H. coll cloning and expression \ectors such as BLIJ t^SCRIPT^ (Stratagene), in w hich the 
sequence encoding HGST may be ligated uito the vector in frame w ith sequences lor the 
amino-tcrminal Met and the subsequent 7 residues of B-galactosidase so that a hybrid protein 
is produced: pIN vectors (Van Heeke, G. and S.M. Schuster ( 1989) J. Biol. Chem. 

10 2M:55i)3-5509): and the like. pGEX \cetors (Promega. Madison, WI) may also be used to 
express lorcign poiypcptiucs as tusion proteins with giututhiorie S-transt erase (GST). In 
general, such fusion protems arc soluble and can easily be purified from lysed cells by 
adsoiption to glutathione-agarose beads followed by elulion in the presence of free 
glutathione. Proteins made in such systems may be designed to include heparin, thrombin, or 

15 factor XA protease cleavage sites so that the cloned polypeptide of interest can be released 
from the GST moiety at wmII. 

In the yeast, Saccharomyces cercvisiae , a number of vectors containing constitutive or 
inducible promoters such as Alpha factor, alcohol oxidase, and PGH may be used. For 
reviews, see Ausubel et al. (supra) and Grant et al. ( 1987) Methods Enzymol. 153:516-544. 

20 In cases where plant expression vectors are used, the expression of sequences 

encoding HGST may be driven by any of a number of promoters. For example, viiiil 
promoters such as the 35S and I9S promoters of CaMV may be used alone or in combinatuMi 
with the omega leader sequence ivom WW { i akamatsu, N. ( 1987) IiMBO .1, 6:307-31 I ). 
Aiicrnaln cl\ . planl promoters sucli as ihc small subunit of Rl BISCOoi lical shock 

25 promoters ma> be used ((\>ru//K (i. cl al. ( 19S4) FAIBO .1. 3: 1671 1680; Broglic, R. ct al. 

( 1984) Science 224:838-843: and Winter. ,1. et al. ( 1991 ) Results Probl. Gell Differ. 
17:85-105). These constructs can be introduced into plant cells b\ diiect f)NA 
iranstormalton oi' pathogen-mediated li ansicclion. Such lechnKjucs arc described m a 
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1^)1-1%). 

An insect s\sicm ma\ alst> be used lo exj^ress MCiS T. \'oy example, m tme such 
s\ stem, Autt)grapha ealitorniea nuclear pc>l\ hedrosis \ n us ( AcNP\' ) is used as a \ eclor to 
express rt)reign iienes in Spodoptera trugiperda cells or \n i'nehoplusia larvae. The 
S sequences encodmg HGST may be cloned into a non-essentuil region of the \ irus, such as the 
polvhedrin gene, and placed under control of the polyhedrin promoter. Successful insertion 
of HCiS^r w ill render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, tor example, S. frugiperda cells 
or Tnchoplusia larvae in which HGST may be expressed (Hngelhard. H.K. et al. ( 1994) Proc. 

10 Nat. Acad. Sci. 91:3224-3227). 

in mammaHan host cells, a number of viiai-based expression systems may be uiiii/ed. 
In cases vvheix an adenovirus is used as an expression vector, sequences encoding HGST may 
be ligated into an adenovirus transcription/translation complex consisting of the late promoter 
and tripartite leader sequence. Insertion in a non-essential El or E3 region of the vual 

13 genome may be used to obtain a viable virus which is capable ot expressing HGST in 

mfected host cells (Logan, J, and Shenk, T. (1984) Pioc. Nail. Acad. Sci. 8 1 :36SS-3659). In 
addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 
used to increase expression in mammalian host cells. 

Specific initiation signals may also he used to achieve more efficient translation of 

20 sequences encoding HGST. Such signals include the ATG initiation codon and adjacent 
sequences. In cases w here sequences encodmg HGS1\ its initiation codon, and upstream 
sequences are inserted into the appropriate expressuMi vector, no additional transcriptional or 
ti anslalional control signals ma\ be needed. llo\\c\ci . m cases w Ikmv only coding sec]ueiicc- 
oi a porticMi thereof, is inserteel, exogenous tidnslational conti'ol signals mcluding the A 1 ( i 

23 initKau>ii codon should be provided, laulhcrmorc, the initialion codon should be in the 

correct reading frame tt^ ensure translation of the entiie insert, lixogencnis iranslational 
elements ami initiatuMT codons ma\ be ot v arums origins, both natinal and syntheUc. The 
criiciencv of expression ma\ be cnhanccLl bv the inclusion of enhancers which are appropriate 
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In addition, a ht^si cell strain ma\ he chosen for its ahiht\ io modukite the expression 
of the inserted sequences or to process the expivssed protein m the desired lashion. Such 
modifications of the polypeptide inchide, hut are not limited to. acetxhition. carho\\ kition, 
gi\cosykitu)n, phosphoryhition, hpidation, and acykition. Post transkitional processing u hich 
5 cleaves a "prepro'' form of the protein may also be used to facilitate correct insertion, folding 
and/or function. Different host cells such as CHO, HeLa, MDCK, Ht:K293, and WI38. 
w hich have specific cellular machinery and chaiactenslic mechanisms for such 
post-translational activities, may be chosen to ensure the correct modification and processing 
of the foreign protein. 

10 For iong-lerm, high-yield pi^oduetion of recombinant proteins, stable cxptession is 

preferred. For example, cell nnes which siabiy express HCiST may be transformed using 
expression vectors which may contain viral oiigins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a sepai ate vector. 
Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an 

15 enriched media before they are switched to selective media. The purpose of the selectable 
marker is to confer resistance to selection, and its presence allows growth and recovery of 
cells which successfully express the introduced sequences. Resistant clones of stably 
transformed cells may be proliferated using tissue culture techniques appropriate to the cell 
type. 

20 Any number of selection systems may be used to recover transformed cell lines. 

These include, but are not limited to. the heipes simplex virus thymidine kinase (Wigler, M. 
et al. (1977) Cell 1 1:223-32) and adenine phosphonbosyltransfeiase (Lowy, k et al. (1980) 
Cell 22:S17 23) genes w hich can be cmploxed m tk or aprt cells. rcspecti\ el\ . .Also, 
antimetabohtc- antibiotic i)v herbicide resistance can be used as tlic basis for selection: for 

25 example, dhti which contcis resistance to methotiexale (W igler. M. el ak i 1980) Proc. Natk 
Acad. Sci. 77:3567-70): npt, which ccMifers resistance to the aminoglycosides necMn\cm and 
(i 4 I S ((\>lhcre-('iarapin. \\ et al ( 198 I ) J. Vh^k Bu^k I 50: 1 - 14) and als or pat. w hich confer 
icMstancc to chlorstilluron and phosphinolricin accl\ Itranstcrasc. respectivcl\ iMuriA, supra). 




hi.slidmc (Hamnan. S.C\ and R.(\ Muiligan ( 1^)SS) Pvoc. Natl. Acad. Sci. N5:S()47-5I ). 
RcclmuK, the use o\ \isiblc markers has gamed popiilani) with siieii markers as aniluic) anms, 
l:> glLiCLironidase and its substrate (il'S, and lueiterase and its substrate lueitenn, being \\idel> 
used not onl\ to identity transtornKints, but also to ciuantitx the amount ol transient or stable 
5 prt)tein exj^ression attributable tt> a sjxxMfie vector s\stem (Rhodes. (\A. et al. ( 1^)^)3) 

Methods Mol. I^iol. 55:121-131). . 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, its presence and expression may need to be confu*med. For example, 
if the sequence encoding HCiS r is inserted \\ ithin a marker gene sequence. reecMiibmant cells 

10 containing sequences encodmg FKjST can he idenlil'ied by the absence ot marker gene 

i'unciion. AliernaLi vely, a marker gene can be placed in iandern vvilii a sequence encoding 
HGST under ihe contiol of a single piomoter. Expression of the marker gene in response to 
induction or selection usually indicates expression of the tandein gene as well. 

Alternalively, host cells which contain the nucleic acid sequence encoding HGST and 

15 express HGS f may be identified by a vaiiety of procedures known to those of skill in the art. 
I hese proceduies include, but are not limited to, DNA-DNA or DNA-RNA hybndt/ations 
and protein bioassay or immunoassay techniques which include membrane, solution, or chip 
based technologies for the detection and/or quantification of nucleic acid or protein. 

The presence of polynucleotide sequences encoding HGST can be detected by 

20 DNA-DNA or DNA-RNA hybridization or amplification using probes or portions or 

fragments c^f pc^lynucleotides encoding HGST. Nucleic acid amplificatu^n based assays 
involve the use of olig(Mmcleotides or oligomers based on the seciuences encoding HCiST to 
detect transformants contammg DN.A or RNA encoeling lIGS T. As used hcrem 
"oh iioiniclcot ulcs*" iU' "ohgomcrs'^ I'ctcr to a nucleic a^ail scciucncc ol at least about 10 

25 luiclcolidcs aiKl as main as about (>() luiclcolides, pictciablv about \ ^ to M) nucleotides. aiKi 

more preleniblv about 20-25 nucleotides, \\ hich can be used as a probe or amphmer. 

A V anctv of pnUocois for detecting and measuring the expression of 1 K ]S \\ using 
either polvcUMial or monoclonal antibodies specitic tor the prt>tein aiv known in the art. 




Lilili/ing moiKKUMKii antibodies rcactnc u> two non-intcrtcnng epitopes on HGS r is 
pre t erred, hut a competitive hi n dine assax may he emploxed. 1 hcse and other assa\s are 
deserihed, among other places, m Hampton, R. et ah ( 1*^)90; Serological Methods , a 
L.ahoratorv Maniiah APS Press, St Paul, MN) and Maddox, D.H. et ah ( I9S3; J. t:\p. Med. 
5 l?8:1211-1216). 

A wide \ anety oT lahels and conjugation techniques are know n by those skilled in the 
art and may be used in various nucleic acid and amino acid assays. Means lor producing 
labeled hybridi/alion or PGR probes for detecting sequences related to polynucleotides 
encoding HGST include oligolabeling. nick translation, end-labeling or PGR amplification 

10 using a labeled nucleotide. Alternatively, the sequences encoding HGS1\ or any portions 

thereoi in ay he cioiieu luio a vcctoi k)i ihc [production oi an luRNA [jio'dc. Such vcclois aie 
known in the art, are commercially available, and may be used to synthesize RNA probes m 
vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled 
nucleotides. These procedures may be conducted using a variety of commercially available 

15 kits (Pharmacia & Upjohn (Kalamazoo, MI); Promega (Madison WI); and U.S. Biochemical 
Gorp. (Cleveland, OH)). Suitable repoiler molecules or labels, which may be used, include 
radionuclides, enzymes, tluoiescenl, chemiluminescent, or chromogcnic agents as well as 
substrates, cofactois, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding HGST may be cultured 

20 under conditions suitable for the expiession and recovery of the protein from cell culture. 
The protein produced by a recombinant cell may be secreted or contained inti*acellularly 
depending on the sequence and/or the vector used. As w ill be understood by those of skill in 
the art, expiession \ eclors containing polynucleotides w hich encode HCiS T ma\ he designed 
to contain signal scejucnces w hicii diiect secretion ot 1 IGS 1 through a prokar\otic or 

25 cLikaiNotic cell memhiane. Olhci rccomhinant constructions ma\ be used to ii)in sCLjucnccs 

enccKling HCiS T to nucleotide sequence encoding a pc^lypeptide domain w hich will facilitate 
purification of soluble proteins. Such purification tacihtating domains include, f^ut are not 
limited lo- metal chchitmg peptides such as histidine-tryptopiian modules that allow 




['1 t)102-3 1)1\ 

punt'icatum s\sicni (Imnuinox C\)rp., Seattle, \VA). The inclusion c>l clea\able linker 
sequences such as those specific tor [-actor XA or enterokmase (hi\ itrogen, San Diego, CA) 
between the purification domain and IlCiS'l max he used to facihtale purification. One such 
expression \ cctor provides for expression of a fusion protein containing HCiS T and a nucleic 
5 acid encoding 6 histidme residues precedmg a thioredoxm or an enterokmase cleavage site. 
The hisiidine residues facilitate purification on IMIAC (immobili/ed metal ion affinity 
chromatography), as described m Porath. J. et al. ( 1992: Prot. li\p. Punf. 3: 263-281 ) while 
the enterokmase cleavage site provides a means for purifying HGST from the fusion protein. 
A discussion of vectors which contain fusion proteins is provided in Kroli, D.J. ct al. ( 1993: 

10 DNACell BioL 12:441-453). 

1[) addition U) lecuiiibiiiani pioducuon, iiagmeiils Oi HGST iViay bc (:>roduced by direct 
peptide synthesis using solid-phase techniques (Merrifield J. ( 1963) J, Am. Chem. Soc. 
85:2149-2154). Protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be achieved, for example, using the Applied 

15 Biosystems 43 1 A peptide synthesizer (Pcrkin Elmer). Various fragments of HGST may be 
chemically synthesized separately and combined using chemical methods to produce the full 
length molecule. 



THERAPEUTICS 

20 HGST bears chemical and structural homology to the human Alpha GSTs Al-I and 

pGTH2. Al-I has specifically been linked to both the activation and deactivation of various 
chemical mutagens or carcinogens as well as to the deveiopmcnt of multi-drug resistance 
(MDR) in cancer. Accordingly llCiS T appears to function \n carcinogenesis b\ activating 
certain mutagens or carcinogens uhilc deaclixating others. IKiS 1 also appcai s lo t unction m 

25 the development ol MDR m cancer. These ma> include, but are not limited to, canceis of the 

bladder, prostate, lung, brain, spmal, ganghal, liver, colon, skin, testis, utertis, kidney, and 
breast. 

1 hetcfoie, m one embodiment. 1 K iS I oi a traiiment or den v alive ihcrcot. mav be 



a mutagen or carciiu^gcn is aflaloxin Bl. 

In another enibodu'nent, a \ eetor capable of expressniL! fiCjS I , or a tragment or a 
deri\ati\e thereof, ma\ also be admmislered to a subject to prevent nnitagencsis cm" 
carcmogenesis caused by a mutagen or carcinogen that has been shoun to be deactivated b\ 
S HGS 1. 

In another embodiment, a vector expressing antisense ot the polynucleotide encoding 
HGS r may be admmislered to a subjecl to prevent mutagenesis or carcinogenesis caused by a 
mutagen or carcinogen that has been show n to be activated by HGST. An example of such a 
mutagen or carcinogen is ethylene dibix)mide. 
10 In another embodiment of the invention, antagonists or inhibitors of HGST may be 

administered to a subjeci to prevent muiagencsis or carcinogenesis following exposure to a 
mutagen or carcinogen that has been shown to be activated by HGST. Antagonists or 
inhibitors of HGST may also be used to prevent or levcrse resistance to a given therapeutic 
agent used to treat cancer. In this embodiment, HGST inhibitors arc co-administered with the 
15 therapeutic agent in order to prevent HGST from inactivating the therapeutic agent. The 

preferred antagonist will bind and inactivate HGST as well as other isoforms of GST at the 
specific binding site of the therapeutic agent. 

In another embodiment, HGST may be administered in combination with other 
conventional chemotheiapeutic agents. The combination of therapeutic agents having 
20 different mechanisms of action will have synergistic effects allowing for the use of lower 
effective doses of each agent and lessening side effects. 

In another emh(^diment of the invention, treatment of a patient's blood w ith HGST 
max be used to remov e a circulating chemical toxin, u here there has been an acute exposure 
lo the toxm. The patient's blood is subjected to plasmapheresis in which the [plasma is passed 
25 through a mmi-filtration column containing stabili/cd HCiS 1\ and the toxin is rcmt>\ed b\ 

binding to the HGS^r before the plasma is returned to the patient. 

Antagonists or inhibitors of HCiS1' max be produced using methods xx hich are 
general Ix known in the ait. In particular, purified 1 1( iS I max be used to produce antibodies 




AntibtKiics which arc specific lor liCiS T ina\ be used directly as an antagonist, or 
mdirecllv as a targeting or deHxerx mechanism for bringing a pharmaccLilical agent to cells or 
tissue which express HGST. Vhc antibodies ma> be generated using methods that are well 
known m the art. Such antibodies may include, but are not limited to, pol\clonaL 
5 monoclonal, chimeric, single chain. Fab fragments, and fragments produced by a hab 

expression library. Neutrah/ing antibodies, (i.e., those which inhibit dimer formation) are 
especially prefeired for therapeutic use. The antibodies may be generated using methods that 
are well known in the art. Such antibodies may include, but are not limited to. polyclonal, 
monoclonal, chimeric, single chain. Fab fragments, and fragments produced by a Fab 

10 expression library. Neutralizing antibodies, (i.e., those w hich inhibit dimer formation) arc 
espccialiy prefened for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, lats, mice, 
humans, and others, may be immunized by injection with HGST or any fragment or 
oligopeptide thereot w^hich has immunogenic properties. Depending on the host species, 

15 various adjuvants may be used to increase immunological lesponse. Such adjuvants include, 
but aie not limited to, Freund s, mineral gels such as aluminum hydroxide, and surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemoeyanin, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli 
Calmette-Guerin) and Corynebactcri um parvum arc especially picfcrable. 

20 It is prefened that the peptides, fragments, or oligopeptides used to induce antibodies 

to HGST have an amino acid sequence consisting of at least five amino acids, and more 
preferably at least 10 amino acids. It is also preferable that they are identical to a portion of 
the ammo acid set|ucncc of the natural protein, and the\ ma\ coniam the entire amino acid 
sctjucncc ot a small, natural 1\ occ uiring molecule. Short strciclics of 1 1( iS 1 amino acids ma\ 

23 be fused with those of anotlicr piotcm such as kc\holc limpei hemoc\anin and antibod\ 

produced against the chimeric molecule. 

Monoclonal antibodies to IIGS T ma\ be prcj^arcd using an\ tcchnuiuc w hich prov ides 
for the production of antibody molecules b\ continuous cell lines in cullurc. These include. 
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al. ( 1^)85) J. Immunol. Methods 81 :3 1-42: (\nc. R.J. el al. ( 1983) Proe. Nail. Aead. Sei. 
8():2()26-2()3(): C\)le. S.P. el al. ( 1*^)84) Mt>!. Cell Biol. 62: 1 09- 1 20 ). 

In addition, teehniqiics de\ek^ped tor the prtKiuetion of "ehimene antibodies", the 
splieniL! of mouse antibody genes to human antibody genes to obtain a moleeule w ith 
5 appropriate antigen speeificity and biological aeti\ ity can be used (Morrison, S.L. et al. 
(1984) Proe. Natl. Aead. Sei. 81:6851-6855; Neubergei, M.S. et al. (1984) Nature 
312:604-608; Takeda, S. et al. (1985) Nature 314:452-454). Alternatively, techniques 
described for the production of smgle chain antibodies may be adapted, using methods known 
in the art, to produce HGS'^r-specitlc single chain antibodies. Antibodies with related 
10 specificity, but of distinct idiotypic composition, may be generated by chain shuffling from 
random combinatorial immunogiobulin libraries (Burion D.R. (Ivvl) Proe. Nail. Acad. Sci. 
88:11120-3). 

Antibodies may also be produced by inducing m vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels of highly specific 

15 binding reagents as disclosed in the literature (Orlandi, R. et al. (1989) Proe. Natl. Acad. Sci. 
86: 3833-3837; Winter, G, et al. (1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for HOST may also be 
generated- For example, such fragments include, but arc not limited to, the F(ab')2 fragments 
w hich can be produced by pepsin digestion of the antibody molecule and the Fab fragments 

20 which can be generated by reducing the disulfide bridges of the F(ab')2 fragments. 

Altcmati vely. Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fixh fragments with the desired specificity (Huse. W.D. et al. 
( 1989) Science 254: 1 275- 1 28 1 ). 

\ anoLis immuntKissax s ma\ be used lor scivennii: lo Klcntif\ antihodiCN ha\]iiy ihc 

25 desired spccificitx. Numerous protocols for compclili\c binding or i mmuiu>iadiomciric 

assa\s using either polyclonal or montK'lonal antibodies w ith established specificities arc well 
knt>\\n m the art. Such immunoassa\s t\picall\ nnoUc the measurement of complex 
formation between IKiS I and its specific antibodx. A two-silc, monoclonaldxiscd 
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In another embodiment of the nnention. the polxnueleotides eneoding HCiS T. or anv 
fragment thereof, or antisense molecules. ma\ be used for therapeutic pinposes. In one 
aspect, antisense to the poKnucleottde encoding HCiS l may be used m situations m w hich it 
would be desnable to block the transcription of the mRNA. In particular, cells ma\ be 
S transformed with sequences complementary to polynucleotides encoding HCjS r. Thus, 

antisense molecules may be used to modulate HGST actix ity, or to achieve regulation of gene 
function. Such technology is now well known in the art, and sense or antisense oligomers or 
larger fragments, can be designed from various locations along the coding or control regions 
of sequences encoding HGS I\ 

Expression vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, 

i\r i rf^m \ / t-i / \ i i K < wt i=i»'t o 1 t-v 1 . w i ^-J . ^ . i » , K,-. . . ,^ ^ ^ J i"^^*- ^ 1 : , . ,^ -i" „ . . i : ,J _ — _ _ _ . . . t 

v.. ..v.... .c*i.v/^*.j i.ct^^n^i icii picioiim^o Li.-)v^^ u^^iivti_y ui iim^icuiiuc ^c^ut:lHJCS lo uie 

targeted oi-gan, tissue or cell population. Methods which arc well known lo those skilled in 
the an can be used lo construct recombinant vectors which will express antisense molecules 
complementary to the polynucleotides of the gene encoding HGST. These techniques are 

15 described both in Sambrook el al. (supra) and in Ausubel et al. (supra). 

Genes encoding HGST can be tumed off by transformmg a cell or tissue with 
expression vectors which express high levels of a polynucleotide or fragment thereof which 
encodes HGST. Such constructs may be used to introduce untranslatable sense or antisense 
sequences into a cell. Even in the absence of integration into the DNA, such vectors may 

20 continue lo transcribe RNA molecules until they arc disabled by endogenous nucleases. 

Transient expression may last for a month or more with a non-replicating vector and even 
longer if appi-opriate replication elements are part of the vector svslem. 

As mentioned aboxe. modifications of gene expression can be obtained b\ desiiinini: 
antisense molecules. DNA, RNA. or PNA. to the control regions of tlic gene cncodiii<^ I ICiS 1. 

--■^ 1-^^- tti^^ pii)motc!s. enhanceis, and mlrons. Oligonucleotides derived from the transcription 

initiation site, e.g., between positions -10 and +10 (vom the start sUe. are preferred. Similarlv, 
inhibition can be achieved using "triple IkMix" base-pairing melhodologw Triple helix pairing 
Is useful becaLisc it causes inhibition of the abililv ot the double helix to t)pcn suflicicntlv for 
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I lubcr, B.l:. and B.I. (\ut. Molecular and ImnnmoloL^ic Approaches . iHilura Publishing Co., 
Ml. Kisct>. N\'}. I^hc antiscnsc molecules ma\ also be designed to block Uanslalion ot" 
niRNA b\ pre\enling the transcript from hnuimg to ri bo somes. 

Ribo/Ames, en/\matic RNA molecules, may also be used to catal>/e the specific 
3 ciea\ age of RNA. I he mechanism ot" ribo/yme action mvoKes seciuence-speciric 

hybridization of the nbo/yme molecule to complementary target RNA, tolkn\ed by 
endonucleol\tic clea\age. [Examples which may be used include engineered hammerhead 
motif ribo/yme molecules that can specifically and efficiently cataly/e endonucleolylic 
cleax age of sequences encoding HGS^r. 

10 Specific nbo/yme cleavage sites w ithin any potential RNA target are initially 

i d e n t i t i 0 d h y sc a n n i p g t h e t urge t m. o ! c c u ! c (o r rib o/ y m. e c ! e a v age s 1 1 e s w h i c h i n c 1 u d c the 
following sequences: GUA, GUU, and GIJC. Once identified, short RNA sequences of 
between 13 and 20 ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structui"al featui"cs which may render the 

15 oligonucleotide inoperable. The suitability of candidate targets may also be evaluated by 

testing accessibility to hybridization with complcmentai"y oligonucleotides using iMbonucleasc 
protection assays. 

Antisense molecules and ribo/ymes of the invention may be prepared by any method 
known m the art for the synthesis of nucleic acid molecules. These include techniques for 

20 chemically synthcsi/mg oligonucleotides such as solid phase phosphoramidite chemical 
synthesis. Alternatively, RNA molecules may be generated by m vitro and m vivo 
transcription of DNA sequences enc(^ding FlCiS r. Such DNA sequences ma\ be incorporated 
into a wide \ arietx ot \ ectors with suitable RNW poKnierasc pr(Mi^oters such as 17 orSPd, 
.Mlcriiatix cl\ . these cDNA constructs that s_\nthcsi/c aniiscnsc RNA constituti \ cl \ or 

2"^ nulucibh can be introduced wMo cell lines, cclK. <m" tissues. 

RNA molecules may be modified to increase intracelltilar stabilitx and half-hie. 
Possible mcKlit ications include, but are not limited to. the addition of flanking seciuences at 
the 5' and/or ends ot the molecule ot the use ot phosplioroth unite or 2' i )-mclli \ 1 rather than 




nontraditional bases such as mosmc, ciueostne, and uyhulosinc, as well as acetyl-, mcth\l-, 
thu) , and siniilarl\ modified lorms of adenine, evtidme, euanine, llnmine, and undine w hich 
are not as easily recogni/ed by endogenous endonuc leases. 

\lan> methods lor introducing vectors into cells or tissues are available and equally 
S suitable for use m vivo , m vitro, and e\ vryo. For c\ vi\ o therapy, \ ectors may be introduced 
into stem cells taken from the patient and clonal ly propagated lor autologous transplant back 
into that same patient. Delivery by transfeclion and by liposome injections may be achieved 
using methods which arc well known in the art. 

Any of the therapeutic methods described above may be applied to any suitable 

10 subject including, for example, mammals such as dogs, cats, cow s, horses, rabbits, monkeys, 
and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical composition, in conjunction with a pharmaceutically acceptable earner, for 
any of the therapeutic effects discussed above. Such pharmaceutical compositions may 

15 consist of HGST, antibodies to HGST, mimetics, agonists, antagonists, or inhibitors of 

HOST. The compositions may be administered alone or in combination with at least one 
other agent, such as stabilizing compound, which may be administered in any sterile, 
biocompatible pharmaceutical carrier, including, but not limited to, saline, buffered saline, 
dextiose. and water. The compositions may be administered to a patient alone, or in 

20 combination w ilh other agents, drugs or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by 
any number of routes including, but not limited to, on\], intravenous, intramuscular, 
intra-arterial. intramedullary, iiUrathecak intrax entricular, transdermal, subcutaneous. 
intraixMitoneal. intranasal, enteral, toj^ical. sublingual, or ivctal means. 

2."^ In addition to the active ingredients, these pharmaceutical compositions nia\ contain 

suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which 
tacilitate processing of the actixe compounds mtt^ preparations w hich can be used 
pharmaccLiticallv . iHirther details on technujues for lormulaiion and aelministration ma\ be 




Pharmaceutical compositions lor oral administration can he formulated using 
pharmaceuiically acceptable carriers well know n m the art m dosa^ies suitable for oral 
administration. Such carriers enable the pharmaceutical compt>sitions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for 
3 ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combination of 
active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain 
tablets or dragee cores. Suitable exeipients are carbohydrate or protein fillers, such as sugars, 

10 including lactose, sucrose, mannitol, or sorbitol; starch from com. wheat, rice, potato, or other 
[)ianLs; cellulose, such as meiliyl celiulose. riydroxypropyhiieuiyl-ceilulose, or sodium 
carboxymethylcellulose: gums including arabic and tragacanth: and proteins such as gelatin 
and collagen. If desired, disintegrating or solubilizing agents may be added, such as the 
cioss-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium 

15 alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 
sugar solutions, which may also contain gum arabic, talc, polyvinylpynolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents 
or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
20 product identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used omlly include push-fit capsules made 
of gelatin, as w ell as soft, sealed capsules made of gelatin and a ccniting, such as glycerol or 
sorbitol. Push-fit capsules can contain acli\e ingredients mixed w ith a Idler or binders, such 
as lactose or staiclics. lubiicanls. such as talc or magnesium stearatc. and. optionalK. 
2^ stabih/cis. In solt capsules, llie active compounds ma> be dissolved or suspendcLl in suitable 

hciuids. such as fatty cmIs, licjuid, or liquid polyethy lene glycol w ith or w ithcnit stabih/ers. 

Pharmaceutical tormulatuMis suitable lor parenteral administration ma\ be formulated 
in aijueous solutions, preferably m phvsiologicalh compatible bufters such as liankss 
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carho\> meth\ 1 cellulose. sorbiloL or dexlran. AdditionalK . suspensions c^l the aeti\e 
eompcHinds ma\ be prepared as appropriate oil\ injection suspensions. Suitable lipophilic 
solvents or \ehicles include tatty oils such as sesame oiL or sxnthetic tatly acid esters, such as 
ethyl oleate or trigKcerides. or liposomes. Optionally, the suspension may also contain 
3 suitable stabilizers or agents w hich increase the solubility of the compounds to allow tor the 

pix^paration of highly concentrated solutions. 

I^\)r topical or nasal administration, penetrants appropriate to the particular barrier t(^ 
be permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
10 manner thai is known in the ail, e.g.. by means of conventional mixing, dissolving, 
gi ariulatiiig, dragce-rriakmg, Icvigatirig, emuisiiymg. encapsulaiing. enii a[)[)ing, or 
lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed w ith 
many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, 
15 succinic, etc. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
con-csponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder w hich may contain any or all of the following: 1-50 mM histidine, 0. 1 7r-27r sucrose, 
and 2-79^ mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
20 appropriate container and labeled for treatment of an indicated condition. For administration 
of HGST, such labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
\\ herein the activ e ingredients are contained m an effective amount to achieve the intended 
puipose. I he determination of an ettcclive dose is well v\ ithm the capabihlv of tliose skilled 
2> m the art. 

\hM any cc^npound, the therapeutically effective dc^se can be estimated initially either 
m cell culture assavs. e.g.. of neoplastic cells, or in animal models, usuallv mice, rabbits, 
dogs, or pigs. I he animal model mav alst) be used to determine the appropriate concentration 
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A therapeutically elTeein e dose reters u> that amount t)t" acti\ e mgredient, lor example 
HCiS r or fragments thereof, antibodies of HCiS I\ agonists, antagonists or inhibitors of 
llCiST. w hich ameliorates the s\ niptoms or condition. I herapetitic efficac\ and toxicitx ma\ 
be determined by standard pharmaceutical procedures m cell cultures or experimental 
3 animals, e.g., HD5() (the dose therapeutically effective in 30^^ of the population) and I.D5() 

(the dose lethal to 50^^ of the population). The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the ratio LD5()/ED5(). Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from 
cell culture assays and animal studies is used in formulating a range of dosage for human use. 
10 The dosage contained in such compositions is preferably w ithin a range of circulating 

conceuLraLioiis thai include the EDfiO with liule or no ioxiciLV. The dosage vanes wiLliin litis 
range depending upon the dosage form employed, sensitivity of the patient, and the route of 
administration. 

The exact dosage will be determined by the practitioner, in light of factors related to 
15 the subject that requires treatment. Dosage and administration are adjusted to provide 

sufficient levels of the active moiety or to maintain the desired effect. Factors which may be 
taken into account include the severity of the disease state, general health of the subject, age, 
weight, and gender of the subject, diet, time and frequency of administration, drug 
eombination(s), reaction sensitivities, and tolerance/response to therapy. Long-acting 
20 pharmaceutical compositions may be administered every 3 to 4 days, every week, or once 
every two weeks depending on half-life and cleai"ance t"ate of the particular formulation. 

Normal dosage amounts may vary from OT to 100,000 micrograms, up to a total dose 
of about 1 g, depending upon the route of administration, (iuidance as to pai'ticular dosages 
and methods ot deli\ er\ is pid\ ided in the hleraUiic anil gcncia!l\ a\ailablc to praclitionci s m 
2."^ ihc ait. Those skilled in the art will employ different formulations lor nucleotides than for 

pixneins or their mhibitc^s. Similarly, delivery oi polynucleotides or polypeptides w ill be 
specific to particular ceils. c(Miditions, locations, etc. 
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diagnosis of conditions or diseases eharaclen/ed b\ expression of FiCiS 1, or in assa\ s to 
monitor patients being treated witli HCiS 1. agonists. antagt>nists or inhibitors. Tiie antibodies 
Liselul ior diagnostic purposes may be prepared m the same manner as those described abo\e 
for therapcLities. Diagnostic assays tor HCiS T include methods which utdi/e the antibodx and 
3 a label to detect HGS1^ in human body fluids or extracts of cells or tissues, rhe antibodies 
may be used w ith or w uhout modification, and may be labeled by joining them, either 
covalentiy or non-covalently, w ith a reporter molecule. A w ide variety of reporter molecules 
w hich are known in the art may be used, several of which are described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring HOST arc 
10 known in the art and provide a basis for diagnosing altered or abnormal levels of HGST 

: M 1 ,J J . . 1 „ V T i/^ : _ „ I- 1 : _ 1 .J 1^ - . 
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body fluids or cell extracts taken from normal mammalian subjects, preferably human, with 
antibody to HGST under conditions suitable for complex formation. The amount of standaid 
complex fonnation may be quantified by various methods, preferably by photometric means. 

15 Quantities of HGST expressed in subject samples, control and disease, from biopsied tissues 
are compared with the standard values. Deviation between standard and subject values 
establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding HGST may be 
used for diagnostic purposes. The polynucleotides which may be used include 

20 oligonucleotide sequences, antisense RNA and DNA molecules, and PNAs. The 

polynucleotides may be used to detect and quantitate gene expression in biopsied tissues in 
which expression of HGST may be correlated w ith disease. The diagnostic assay may he 
used to distinguish between absence, presence, and excess expression of IKiS'f. and to 
iiioniior regulation of IIGS'I lc\els during therapeutic intcr\ cntion. 

2."^ In one aspect. h\ biidi/at ion with Vi R pi"obcs which arc capable ot detecting 

polynucle(nide sequences, including genomic sequences, encoding HCjST or ck>sely relateti 
molecules. ma\ be used to identifv nucleic acid scciuences which encode HCiST. 'The 
spcciticil\ ot the probe, whelhei" it is made \\'om a highK specific region, e.g.. M) unu]uc 
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intermediate, ov low ) will determine whether the probe identities only natural 1\ oeeiirring 
sec]uenees eneodmg IIGS T. alleles, or related sequenees. 

Probes may also be used lor the deteetion of related sequenees. and should preterablv 
eontam at least S()9f of the nueleotides from any of the FKiST eneoding sequenees. 1 he 
5 hybridi/ation probes of the subjeet invention may be DNA or RNA and derived from the 
nueleotide sequenee of SHQ ID NO:2 or from genomie sequenee ineluding promoter, 
enhaneer elements, and mtrons of the naturally oeeurring MGS r. 

Means for produeing speeifie hybiidization probes for DNAs eneoding FiGST inelude 
the eloning of nueleie aeid sequences encoding HGST or HGST derivatives into vectors for 
10 the production of mRNA probes. Such vectors are know n in the art. commercially available, 
and may be used to synthesize RNA probes m vitro by means of the addition of the 
appropriate RNA polymerases and the appropriate labeled nucleotides. Hybridization probes 
may be labeled by a variety of reporter groups, for example, radionuclides such as 32P or 
35S, or enzymatic labels, such as alkaline phosphatase coupled to the probe via avidin/biolin 
15 coupling systems, and the like. 

Polynucleotide sequences encoding HGST may he used for the diagnosis of 
conditions or diseases which are associated with expression of HGST. Increased expression 
of HGST may be a useful predictive indicator of carcinogenesis or an indicator of potential 
MDR in cancerous tissues. The polynucleotide sequences encoding HGST may be used in 
20 Southetn or northem analysis, dot blot, or other membrane-based technologies; in PGR 

technologies: or in dip stick, pin, ELISA or chip assays utilizing fluids or tissues from patient 
biopsies to detect altered HGST expression. Such qualitative or quantitative methods are 
well know n in ihc art. 

In a particular aspect, the tuicleolidc sequences cncoelmg 1 1( IS 1 nia\ he useful in 
25 assays that delect activation or mduction of various cancers, particularlv lln)se mentioned 

above. Vhc nucleotide secitiences encoding HCkSl^ ma\' be labeled by standard methods, and 
added to a fluid ov tissue sample from a patient under conditions suitable for [he ft>rmation of 
hybridi/alion complexes. Alter a suitable mcubation peruKl. the sample is washed and the 



the nuclcolidc scc[uencos ha\ c hyhridi/cel \\ ilh nLicleolidc sequences in [he sample, and the 
presence of altered levels of nucleotide seciuences encoding HCiS I m the sample indicates th 
presence ot the associated disease. Such assays may also be used to e\ aluate the efficacy of a 
particular therapeutic treatment regimen m animal studies, in clinical trials, or in monitoring 
the treatment of an individual patient. 

hi order to provide a basis for the diagnosis of disease associated with expression of 
HGS^r, a normal or standard profile for expression is estabhshed. 1 his may be accomplished 
by combining body fknds or cell extracts taken from normal subjects, either animal or human 
with a sequence, or a fragment thereof, which encodes HGST, under conditions suitable for 
hybridization or amplification. Standard hybridization may be quantified by comparing the 
values obiaincd iVom normal subjects with those from an experiment where a known amount 
of a substantially pui'ified polynucleotide is used. Standard values obtained from normal 
samples may be compared with values obtained from samples from patients who are 
symptomatic tor disease. Deviation between standard and subject values is used to establish 
the presence of disease. 

Once disease is established and a treatment protocol is initiated, hybridization assays 
may be repeated on a regular basis to evaluate whether the level of expression in the patient 
begins to approximate that which is observed in the normal patient. The results obtained 
from successive assays may be used to shov/ the efficacy of treatment over a period ranging 
from several days to months. 

With respect to cancer, the presence of a relatively high amount of transcript in 
biopsied tissue from an indiv idual may indicate a predispositi(Mi for the development of the 
disease, or may j^rox idc a means tor tletectmg tlie disease pi ior to the aj^pearance of actual 
clmical svmptoms. A moiv definitive diagnosis ol this ivpc mav allow health piofcsMonaK t( 
emplov preventative measures or aggressive treatment earlier thercbv preventing the 
dcv eU^pment or further progression of the cancer. 

Additional diagnostic uses for (Migonucleotides designed from the sec]uenccs encoding 
IKiSI mav involve the use of PC/R. Such ol isomers mav be chemical 1\ svnthesi/ed. 




antisense (3'<-5" ). cmpk)\cd under i>ptimi/cd conditions ior idcniitication ot a spccitic L!cnc 
or condilion. Vhc same two oliLzomers, nested sets of oligomers, or e\ en a degenerate |^ooI o( 
oligomers ma\ he employed under less stringent conditions for detection and/or ciuantitation 
of closely related DNA or RNA sequences. 
5 Vlethods which may also he used to quantitate the expression of HGS T include 

radiolaheling or hiotinylatuig nucleotides, coamplification of a control nucleic acid, and 
standard curves onto which the experimental results are interpolated (Mclhy. P.C\ et al. 
(1993) J. hnmunol. Methods, 139:235-244; Duplaa. C. et al. (1993) Anal. Biochem. 
212:229-236). The speed of quantitation of multiple samples may he accelerated hy running 

10 the assay in an ELISA format where the oligomer of interest is presented in various dilutions 
and a speclropiiolomen ic t)r coioiiineLric response gives rapid quanliLalioTi. 

In another emhodiment of the invention, the nucleic acid sequences which encode 
HOST may also he used to generate hyhridization prohes which are useful for mapping the 
naturally occurring genomic sequence. The sequences may he mapped to a particular 

\5 chromosome or to a specific region of the chromosome using well known techniques . Such 
techniques include FISH, FACS, or artificial chromosome constructions, such as yeast 
artificial chromosomes, haeterial artificial chiomosomes, hacteiial PI constiuctions or single 
chromosome cDNA lihraries as reviewed in Price, CM. (1993) Blood Rev. 7:127-134, and 
Trask, BJ. (1991) Trends Genet. 7:149-154. 

20 FISH (as descrihed in Verma et al. ( 1988) Human Chromosomes : A Manual of Basic 

Techniques , Pergamon Press, New York, NY) may he conelated with other physical 
chromosome mapping techniques and genetic map data. Examples of genetic map data can 
he found m the 1994 (lenomc Issue ot Science (265: 19Slf). Correlation hetueen the location 
ot the gene encodmg llCiS l c^n a [^h\s]cal chromosomal map and a specitic disease , oi 

25 predisposition to a specitic disease. ma_\ helj:> delimit the leguHi ot DNA associated with tlial 

genetic disease. The nucleotide sequences of the suhject unention may he used to detect 
differences m gene sec|uences hetween nt^'mal. carrier, or aflected indi\ iduals. 

J_n situ h\hndi/ation ot cliromosomal j^reparations and ph\sical mapping leclinKjucs 
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species, such cis mouse, nia\ rexeai assoeialeLi markers e\eu it the number or arm ot a 
partieulai- human chromosome is not known. Neu sequences Cvin he assigned to 
chromosomal arms, or parts tliereot. h\ ph\sical mapping. This i^rtmdes \aluable 
int'ormatu)n tt> investigators searching lor disease genes using positional cloning or other gene 
5 discoxerv techniques. Once the disease or syndrome has been crudely !oeah/ed by genetic 

linkage to a particular genomic region, lor example, A T to 1 lq22-23 (Ciatti, R.A. el al. ( 1^)88) 
Nature 336:377-580), any seciuenccs mapping to that area may represent associated or 
regulatory genes for further mvestigation. The nucleotide sequence of the subject mvention 
may also be used to detect differences in the chromosomal location due to translocatu^n, 

10 inversion, etc. among normal, earner, or affected indi\ iduals. 

in anoiher embodimeni of ihc invention, HCiST, lis caiaiyiic or nnmunogenic 
fragments or oligopeptides thereof, can be used for screening libraries of compounds in any 
of a variety of drug screening techniques. The fragment employed in such screening may be 
free in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. 

15 The formation of binding complexes, between HGST and the agent being tested, may be 
measured. 

Another technique for drug screening which may be used provides for high throughput 
screening of compounds having suitable binding affinity to the protein of interest as described 
in published PCT application WO84/03564. In this method, as applied to HGST large 

20 numbers of different small test compounds are synthesi/ed on a solid substrate, such as 

plastic pins or some other surface. The test compounds are reacted with HGST, or fragments 
thereof, and washed. Bcuind HCkST is then detected b\ methods well known in the art. 
PunfiCLl IKiS'l can also he coaled diicclK ontt) plates for use m the aforemeniioncLl ilrug 
sciccnmg tcchmqiies. A llci'nati \ el\ . non-nculial i /i ng antibodies can be used to capUiic the 

25 pcj^lidc and immobih/e il on a sohd support. 

In another embtidiment, one ma\ use ct>mpetiti\e drug screening assa\s m w hich 
neutrah/ing antibcHiies cajxiblc of binding 1 K iST specificalU compete w ith a test con^pound 
iov buuhng 1 1( iS 1. In this manner, the antibodies can be used to Lletcct the presence ot an\ 
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used m chemical nuuagcnicil\ tests, such as the well know n Ames lest, to determine the rt^le 
of HCiS'I" m activation cm' deactiv ation o! specU'ic mutagens or carcmogens. These tests may 
then be used to screen Tor antagonists oi" HCkS T mediated actu atuMi of mulagcns/carcinogens 
w hich ma\ be used in the prev ention of carcinogenesis as prev iously discussed. 
5 In additional embodiments, the nucleotide sequences which encode HGS may be 

used in any molecular biology techniques that have yet to be developed, prov ided the new 
techniques relv on properties of nucleotide sequences that are currently known, including, but 
not limited to, such properties as the triplet genetic code and specillc base pair mteractions. 
rhe examples below are provided to illustrate the subject invention and are not 
10 included for the purpose of limiting the invention. 

EXAMPLES 

1 BEAD TU 104 cDNA Library Construction 

The BLADNOT05 cDNA library was constructed from microscopically normal 
15 bladder tissue; the BLADTLJT04 cDNA library w^as constructed from cancerous bladder 

tissue from the same donor. Both tissues were obtained from a 60-year-old Caucasian male 
following excision during a radical cystectomy of a grade 3 of 3 transitional cell caicinoma in 
the left bladder wall which extended through the museularis propria mto the perivascular fat. 
Carcinoma in-situ was identified in the dome and trigone. The distal urethral margin was 
20 negative for tumor, but the prostate showed adenfibromalous hyperplasia. The patient 

presented with dysuria and slowing of the urinary stream. In addition to the cystectomy, the 
patient underwent a prostatectomy and vasectomy. Patient had a history of atherosclerosis 
aiul tobacco abuse. I he follow mg substances were being taken bv the doncM-; v itamms C i<: 
1:. cloves, wormwood, black walnut hull, beta carotene, garlic, aiul pv cnodgcncal . bamilv 
23 hiNti)i v included diabetes. Ivpc 1. in both the molher and talhci , a malignant ncopla>m of the 

stomach for the father, and atherosclerosis in a sibling. 

The tro/en tissue was homogeni/ed and Ivsed using a l\>lvtron PT-3()()() homogeni/er 
(Hrinkmann instruments, W'estburv, N J ) m guanidinium isothiocvanate solution. I lie Ivsatc 
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The RNA was cxlraclcd with acid phent>l pH 4.7, precipitated using 0.3 M sodium acetate 
and 2.-'S xolumes orethanoL resuspended m RNAse-free \sater. and DNase treated at 37 C\ 
The RNA extraction was repeated with acid phenol pH 4.7 and precipitated with sodium 
acetate and ethanol as before. 1 he mRNA was then isolated using the OLICiO l liX kit 
S (QIAGEN, Inc.: Chatsworth. CA) and used to construct the cDNA library. 

The mRNA was handled according to the recommended protocols in the 
SUPERSCRlP r plasmid system lor cDNA synthesis and plasmid cloning (Cat. #18248-013, 
Gibco/BRL). A new plasmid was constructed using the following procedures: the 
commercial plasmid PSPORTl (Gibco/BRL) was digested with E-xoR I restriction en/yme 
10 (New England Biolabs. Bexerley, MA), the overhanging ends of the plasmid were tilled with 
Klenow enzyme (New England Biolabs) and 2'-ueoxyrRicieoiiue 5'-ti IpliospiiaLCS (dN TPs), 
and the intermediate plasmid was self-ligatcd and tiansformed into the bacterial host, E. coli 
strain JM109. 

Quantities of this intermediate plasmid were digested with Hind III restriction enzyme 
15 (New England Biolabs), the overhanging ends were filled with Klenow and dNTPs, and a 

10-mer linker of sequence 5',,.CGGAATTCCG...3' was phosphorylated and ligated onto the 

blunt ends. The product of the ligation reaction w as digested with EcoR I and self-ligated. 

Following transformation into JM109 host cells, plasmids designated pINCY were isolated 

and tested for the ability to incorporate cDNAs using Not I and EcoR I restriction enzymes. 
20 BLADNOT05 cDNAs were fractionated on a SEPHAROSE CL4B column (Cat. 

#275105-01, Pharmacia), and those cDNAs exceeding 400 bp were ligated into plNCY I. 

The plasmid pINCY 1 was subsequently transformed into D\\5a coriipetent cells (Cat. 

#182584)12, (ubco/BRL). 



25 il Isolation and Stqutncin^ ot cDN A Clones 

Plasmid 13NA was released trom the cells and purified using the R.E.A.L. PREP 96 
plasmid kit ((\»tak>g #26 1 73: (^)IAG1-N. Inc. ). I his kit enabled the simultaneous ixirification 
of 96 samples in a 96 well block using multi-channel reagent dispensers. I hc rcctMiimended 




25 mg/l. and liInccioI at 0.4^ r : 2 ) after inoculation, the cultures were incubated for hours 
and at the end of incubation, the cells w ere l\sed w uh 0.3 ml of l\sis buffer: and 3) folkn\ mg 
isopropanol precipitation, the plasmid DNA pellet was resuspended in 0.1 ml of distilled 
w ater. After the last step m the prcUocoL samples were transferred to a 96-well block for 
5 storage at 4 ' C 

Vhc cDNAs were sequenced by the method of Sanger et al. (1973, J. Mol. Biol . 
94:44 If), using a MK^ROLAB 2200 (Hamilton. Reno, NV) in combination with Peltier 
P'rC2()0 thermal cyclers (MJ Research, Watertown, MA) and Applied Biosystems 377 DNA 
sequencing systems. 

10 

iii Homoioj^y Searchinj^ of cONA Clones and rheir Deduced Proieins 

Each cDNA w as compared to sequences in GenBank using a search algorithm 
developed by Applied Biosystems and incoiporalcd into the INHERIT 670 sequence analysis 
system. In this algorithm. Pattern Specification Language (TRW Inc, Los Angeles, CA) was 

\5 used to determine legions of homology. The three parameters that determine how the 

sequence comparisons run were window size, window^ offset, and enor tolerance. Using a 
combination of these three parameters, the DNA database was searched for sequences 
containing regions of homology to the query sequence, and the appropriate sequences were 
scored with an initial value. Subsequently, these homologous regions were examined using 

20 dot matrix homology plots to distinguish regions of homology from chance matches. 

Smith-Waterman alignments were used to display the results of the homology search. 

Peptide and protein sequence hom(^logies were ascertained using the INHERIT 670 
sec|uence anal \ sis sxstem using the methods similar to those used m DNA se(.|uence 
liomologics. l^iUcrn Specification Language aiui paramcler windows were used to search 

23 protein databases foi sequences conlainmg regions of hcMnolog> which were scored with an 

initial \ alue. Dot-matrix homology plots were examined tc^ distingtiish regions of significant 
homolc^g) from chance matches. 

BLAS [ . w Inch stands for Basic Local Alignment Search Tool ( AltschuL S.h. ( 1993 ) 



I 1 
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amnK> acid scciulmiccs to determine seciLience similarit\ . Because oi the local nature o\' the 
ahgnnients, BLAS T is especialK Liseful m deterniining exact inatclies or m identil\inii 
homolc^gs. IM.AS r is useful I'ov matches \\ hich do not contain gaps. The t undamental unit of 
BLAS r algorithm output is the Hiuh-scoring Segment Pair (I ISP). 
3 An HSP consists of two sequence fragments of arbitrary hut ecjual lengths w hose 

ahgnment is locally maximal and for w hich the ahgnment score meets or exceeds a tht*esho!d 
or cutoff score set by the user. The BLAST approach is to look for FlSPs between a query 
sequence and a database sequence, to evaluate the statistical significance of any matches 
found, and to report only those matches w hich satisfy the user-selected thiTshold of 
10 significance. The parameter E establishes the statistically significant threshold for reporting 
database sequence matches. H is inteiy^rctcd as the upper bound of the expected frequency cm 
chance occuiTcnce of an HSP (or set of HSPs) within the context of the entire database 
search. Any database sequence whose match satisfies E is rcpoilcd in the program output. 

15 IV Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript 
of a gene and involves the hybridization of a labeled nucleotide sequence to a membrane on 
which RNAs from a particular cell type or tissue have been bound (Sambrook et ah, supra). 

Analogous computer techniques using BLAST (AltschuL S.F. 1993 and 1990, supra) 
20 are used to search for identical or related molecules in nucleotide databases such as GenBank 
or the LIEESEQ database (Incyte Pharmaceuticals). This analysis is much faster than 
multiple, membrane-based hybridizations. In additi(Mi, the sensitivity of the computer search 
can be modified t(^ delcrminc whether an\ particLilar match is catcgori/ed as exact or 
homologous, 

25 1 he basis of the search is the j^roduct scoic which ts elcfincd as: 

^( sequence identitv x maximum BLAST score 

100 

I he product score takes into account botli the degree c^l similarit\ between two sccjuenccs and 




Lisuallx idenlificd by selecting ihcise which shcn\ product scores between 15 and 40. although 
lower scores mny identity related molecules. 

The results of northern anaK sis are reported as a list of libraries m w hich the 
transcript encoding HCiS I occurs. Abundance and percent abundance are also reported. 
5 Abundance dnrctly reflects the number of times a particular transcript is represented in a 

cDNA library, and percent abundance ts abundance divided by the total number of sequences 
examined in the cDNA library. 

V Extension of H(iS T-Encodin^ Polynucleotides to F ull Length or to Recover 
10 Regulatory Sequences 

r^. 11 t_ ii^f^T^ . ^:,J . /c T / \ 1 1 A x : r~\ . ^ \ : „ : 

run iciigiii I -ciicuuiiiu iiut^icic auiu sequence v^i-V "-^ inw.^ ; l^ u^c^a lu ucMi^ii 

oligonucleotide primers tor extending a partial nucleotide sequence to full length or for 
obtaining 5' or 3\ intron or other control sequences from genomic libiaries. One primer is 
synthesized to initiate extension in the antisense direction (XLR) and the other is synthesized 

15 to extend sequence in the sense direction (XLF). Primers are used to facilitate the extension 
of the known sequence '\^utward" generating amplicons containing new, unknown nucleotide 
sequence for the region of interest. The initial primers are designed from the cDNA using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be 22-30 
nucleotides in length, to have a GC content of 50% or more, and to anneal to the target 

20 sequence at temperatures about 68 "-72" C. Any stretch of nucleotides which would result in 
haii*pin structures and primer-primer dimeri/ations is avoided. 

The (anginal, selected cDNA libraries, or a human genomic library are used to extend 
the sccjucncc: the latter is most useful to obtain 5' upstream regions. If more extension is 
neccssaiA ordcMicel. aLklilional sets of piimcrs aic designed to tLiillicr extend the known 

25 icgion. 

B\ following the mstructuMTs for the XL-F^GR kit (Perkm Idmer) and thoroughly 
mixing the en/\me and reaction mix, high fidelitx amplification is obtained. Beginning with 
40 pmot ot each prmnci" anil the recommended concentrations ot all otfier components of the 
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Stop i 
Step 2 
Step 3 
Sicp 4 
Step 5 
Step 6 
Step 7 
Step 8 
Stop 9 
Stop 10 
Step I 1 
Step 12 
Step I 3 



94 C Un 1 mill (initial deiKittiration ) 



b5 C \ov 1 mm 

68 C tor 6 mm 

94 tor IS see 

()5 C tor 1 mm 

()8 C lor 7 mm 



Repeat step 4-6 tor 13 additional eyeies 



94 C lor 15 see 
65 C tor 1 min 



68 C lor 7: 1 5 min 

Repeat step 8-10 for 12 eyeies 

72 C tor 8 mm 

4 C (and holding) 



A 5- 10 u\ aliquot of the reaeiion mixture is analyzed by eieeirophorcsis on a low 
eoneentration (about 0.6-0.8%) agarose mmi-gel to determine whieh reaetions were 
sueeessful in extending the sequenec. Bands thought to eontain the largest produets are 
seleeted and removed from the gel. Further purifieation involves using a eommcreial gel 
extraction method such as QIAQUICK (QIAGEN Inc., Chatsworth, CA). After recovery of 
the DNA. Klenow enzyme is used to trim single-stranded, nucleotide overhangs creating 
blunt ends which facilitate religalion and cloning. 

After ethanol precipitation, the products arc redissolved in 13 u\ of ligation buffer, 
\u\ T4-DNA ligase ( \5 units) and lu\ T4 polynucleotide kinase are added, and the mixture is 
incubated at room temperature for 2-3 hours or overnight at 16 C. Competent coli cells 
(in 40 u\ of appropriate media) arc transf(M*med with 3 u\ of ligation mixture and cultured in 
80 i(\ of SOC medium (Sambroc^k el al., supra). After incubation for one hour at 37 C, the 
uht)le traiistormation mixture is plated on Luna Bertani (LB)-auar (Samhrook et a!., supra) 
et>ntaininL: 2x Tarb, 1 he tolloumg da\ . se\ eral colonies are randomly picked trom each plate 
and cultured in 1 50 .< \ ot liLjUid LB/2x C'arb medium i^laced m an mdi\ idual well ot an 
appropriate, commercial l\-a\ai table, stei"ile 96-\vell microtiter plate. 4"he following da\. 5 u\ 
of each overnight culture is tianstened int(^ a non-stenle 96-\\ell ]ilatc and after di lulion 1: 10 
\v itli w ater. 5.1 ot each sample is transterred into a Pi R arrax. 
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Liscd for the extension reaction are added to each well. Amplification is performed LismLi the 
tollow ini: conditions: 



Step 1 C lor 60 sec 

Step 2 94 for 20 sec 

Step 3 55 C for 30 sec 

Step 4 72 CMor9()sec 

Step 5 Repeat steps 2 4 lor an additional 29 cycles 

Step 6 72 C for 180 sec 

Step 7 4 C (and holding) 



Aliqiiots of the PCR reactions arc run on agarose geis together with molecular weig 
markers. The sizes of the PCR products are compared to the original partial cDN As, and 
appropriate clones arc selected, ligated into plasmid, and sequenced. 

VI Labeling and Use of Hybridization Probes 

Hybridization piobes derived from SEQ ID NO:2 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 
20 base-paiis, is specifically described, essentially the same procedure is used with larger 
cDNA fragments. Oligonucleotides are designed using state-of-the-art software such as 
OLIGO 4.06 softw are (National Biosciences), labeled by combining 50 pmol of each 
oligomer and 250 uC\ of [y-^"P] adenosine triphosphate (Amcrsham) and T4 polynucleotide 
kinase (DuPont NEN, Boston, MA). The labeled oligonucleotides arc substantially purified 
with SEPHADEX G-25 superfine resm column (Pharmacia & Upjohn). A portion containin 
10 counts per minute of each of the sense and anlisense oligonucleotides is used m a typical 
mcmlManc based h\ bridi/ation anal\ mn of human genomic DN A digested with one ot the 
lollowmg endonucleases { Asc 1. Hgi 11. b;co Rl, Pst I, Xba 1 . or ^^ u II: DuPont NP;N). 

1 he DNA fn^m each digest is fractionated on a 0.7 percent agarose gel and 
transferred to n\ion membranes (Nytran Plus. Schleicher cK: SchuelL Durham. Nil). 
1 1\ bndi/alion in carriCLi out lor lb ht>urs at 40 (\ l o rcmo\c noiispccitic signaK. biols aic 
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RochcsiLM'. N ^') IS expensed to the blots ov the bkns arc exposed m a Phosphorlmager cassette 
(Molecular Dxnaiiiics. Siinnwalc. C\Ak h\bndi/ation patterns are compared \ isuall\. 

V II Antisense Molecules 

5 Antisense molecules to the HGST-encoding sequence, or any part thereof, is used to 

inhibit iri \ i\ o or m \'itro expression of naturally occurring HGS1\ Although use of antisense 
oligonucleotides, comprising about 20 base-pairs, is specifically described, essentially the 
same procedure is used with larger cDNA fragments. An oligonucleotide based on the coding 
sequences of HGS r, as shown in Figures 1 A, IB, and IC, is used lo inhibit expression of 

10 naturally occurring HGST. The complementary oligonucleotide is designed from the most 
unique 3' sequence as shown in Figures lA, IB, and IC and used either Lo inhibit 
transcription by preventing promoter binding to the upstream nontianslated sequence or 
translation of an HGST-encoding transcript by preventing the ribosome from binding. Using 
an appropriate portion of the signal and 5' sequence of SEQ ID NO:2, an effective antisense 

15 oligonucleotide includes any 15-20 nucleotides spanning the region which translates into the 
signal or 5' coding sequence of the polypeptide as shown in Figures 1 A, IB, and IC. 

VIII Expression of HGST 

Expression of HGST is accomplished by subcloning the cDNAs into appropriate 
20 vectors and transforming the vectors into host cells. In this case, the cloning vector PSPORT, 
previously used for the generation of the cDNA library, is used to express HGST in E. coli . 
l^pstrcam of the cloning site, this vectcM- c(Mitains a promoter for R-galactosidase, followed by 
sec|ucnce containing the ammo-tcrmmal Met, and the subscciuent sc\cn residues of 
l.>^galactos]dasc. Immcelialclx following these eight icsulucs is a bacteriophage promoter 
25 usef ul for Uanscription and a linker containing a number of unKiue resii ictu>n sites. 

Induction t^f an isolated, transformed bacterial strain u ith WYG using standard 
methods produces a fusion protein w hich ctinsists of the first eight residues of 
L>-galactosidasc. about 5 to 15 residues o( linker, and the full length protein. The signal 
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l\ Demonstration of H(JS I' Activity 

HCiS r acli\ it\ IS determined h\ measunnL: the en/\me ealalwed ec)n}Ui:alK>n o( (iSH 
w ith I -chl()rc)-2,4-dinilr()ben/ene (DC'NB), a common substrate iov most CiS I s. is 
5 meuhated with 1 mM C13NB and 2.5 mVl GSH together m O.IM potassium phosphate bufter. 
pH 6.5, at 25 C. Vhc conjugation reaction is measured by the change in absorhance at 340 
nm using an ullra\iolet spectrophotometer. Hn/yme activity is proportional to the change in 
absorbance at 340 nm. 

10 X Production of HGST Specific Antibodies 

HGST thai is sunstaniialiy purified using PAGE ciecirophoiesis (Sambrook, supra), or 
other purification technicjues, is used to immunize rabbits and to produce antibodies using 
standard protocols. The amino acid sequence deduced from SEQ ID NO:2 is analyzed using 
DNASTAR software (DNASTAR Inc) to determine regions of high immunogenicity and a 

15 corresponding ohgopolypeptide is synthesized and used to raise antibodies by means known 
to those of skill in the ail. Selection of appropriate epitopes, such as those near the 
C-terminus or in hydrophilic regions, is described by Ausubel et al. (supra), and others. 

Typically, the oligopeptides are 15 residues in length, synthesized using an Applied 
Biosystems peptide synthesizer Model 431 A using fmoc -chemistry, and coupled to keyhole 

20 limpet hcmocyanin (KLH, Sigma, St. Louis, MO) by reaction with 

N-maleimtdobenzoyl-N~hydroxysuccinimide ester (MBS; Ausubel et ak, supra). Rabbits are 
immunized w ith the oligopeptide-KLM c(nnple\ in complete F^reund s adjuvant. The 
resulting anliseia are tested for antipeptide acli\il\, loi" example, b\ bmdmg the j^eptide to 
jMastic. l4ocknig with BS.A, reacting uith rabbil antiscia. washing, and reacting witli 

25 radioiodmatcd, goat anti-rabbit IgCi. 

\I l*urification of Naturally Occurring IKJS I I sin^ Specific Antibodies 

NaluralK occurring ov lectMiibmanl IKiS'l is substantialK purUied b\ i nimunoat t tnit\ 
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such as C^NBr-actnalcd SliPMAROSt: (Pharmacia Tpjohn). After the ccuiphiig, the rcsm 
IS blocked and washed acccMxlmg to the manufacturers mstructions. 

Media containing UCkS T is passed c)\er the immunoaffinitN column, and the column is 
washed under conditions that allow the preferential ahsorbance of HCiS I (e.g., high ionic 
strength buffers in the presence of detergent). The column is eluted under conditions that 
disrupt antibody/I iGST binding (e.g., a hulTer of pH 2-3 or a high concentration of a 
chaotrope, such as urea or thiocyanale ion), and HGS^I^ is collected. 

XII Identirication of Molecules Which Interact with IK;S I 

HGST or biologically active fragments thereof are labeled w ith '"T Bolton-Hunter 
reagent (Bolton ci ai. (iv73) Biochem. J. i33: 329). Candidate molecules previously anayeu 
in the wells of a multi-well plate are incubated with the labeled HOST, washed and any wells 
with labeled HGST complex are assayed. Data obtained using different concentrations of 
HGST are used to calculale values for the number, affinity, and association of HGST with the 
candidate molecules. 

XIII Plasmapheresis of a Patient's Blood to Remove Toxins 

HGST is used to purify a patient's blood after exposure to a chemical toxin. In this 
case, rather than administering a pharmaceutical composition of HGST to the patient, the 
toxin is removed from the blood by plasmapheresis in a manner similar to thai used for 
kidney dialysis. HGST is stabilized in a mini-filtration column and the patient's plasma is 
passed through the column where the toxin is remoxed by binding to HGST before the 
phisma is rclurncd to the patient. 

.All publications aiKl patents mentioncLl in the abtne spcct fic<ilion arc herein 
incorporated b\ icfciencc. V arious modifications and varialions ot the described method and 
s\stem of the nnention w ill be apparent to those skilled m the art w ithout departing from the 
scope and spirit of the nn ention. Althcuigh the un ention has been described in connection 
w ith specific jMvferied embodiments, it should be understood that the uucnlion as claimed 
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mcilecLilar hu)k)g\ or rckitcd ticlds arc inlcnded to be within the scope of the iolloum^ 
claims. 



